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(57) Abstract 

A mass spcctrometric method for sequencing nucleic acids using RNA polymerases, including DNA-dcpcndent and RNA-depcndent 
RNA polymerases, is provided. The methods include use the RNA polymenLse is to generate a set of nested RNA transcripts, which are 
analyzed mass spectrometry. A method of identifying transcriptional terminator sequences or attenuator sequences is also provided. 
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MASS SPECTROMETRIC METHODS FOR SEQUENCING NUCLEIC ACIDS 
RELATED APPLICATIONS 

Benefit of priority is claimed to U.S. application Serial 

^; Nb.08/990, 851, filed December 15, 1997, to Changwdn Kang, Young- 
5 Soo Kwon, Young Tae Kim, Hubert Koster, Daniel Little, Maryanne 
O'Donnell, Guobing Xiang, Charles Cantor and David Lough, entitled 
"MASS SPECTROMETRIC METHODS FOR SEQUENCING NUCLEIC 
ACIDS". Where permitted, the subject matter of this application is 
incorporated herein by reference In Its entirety* 

10 The subject matter of this application is related tb the following 

copending applications, international applications and patents: U.S. 
Patent Nos. 5,605,798, 5,622,824 and 5,547,835, 5,691,141, 
Intematiohat PCT application No. WO 98/20019, WO 98/20166 and 
WO 97/42348. Where permitted, the subject matter of each of these 

15 applications and patents is incorporated herein in its entirety. 
BACKGROUND OF THE INVENTION 

Nucleic acid sequencing methods are important and powerful tools 
in the molecular biologist's repository of techniques for assessing and 
understanding gene expression and regulation. Methods for sequencing 

20 PNA molecules include chemical degradatiori sequencing (Maxam et aL 
(1977) Proc. Natl. Acad. Sci. USA 74: 560, see, also, Ambrose et aL 
(1987) Methods Ehz. 152: 522) arid chain termination sequencing 
(Sanger et aL (1977) Proc. NatL Acad. Sci. USA 74: 5463). The 
Maxam-Gilbert sequencing method is a degradative method that relies on 

25 specific cleavage of DNA fragments. A fragment of DNA, which is 
radiolabeled at one end/ is partially cleaved in five separate chemical 
reactions. Each reaction is specific for a type of base or a base so that 
five populations of labeled fragments of differing lengths are generated. 
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The populations of fragments are resolved by polyacrylarnide gei 
electrophoresis (PAGE). 

The chain termination or Sanger method relies on a DNA synthesis. 
1 ^ Slngle:stranded DNA is used as a template and labeled primers are used 
5 to iriit|ate formatipn of complementary strands. The synthesiis reaction is 

run in the presenpe of the four depxynucieptides (dNtPs), dATP, dTTP, 
; dCTp and dGTP and one of the four didepxynucleotldes (ddNTPs), 
. ddATP. ddTTP, ddCTP, and ddGTP. The complementary strands ar^ 
prematurely terminated along the chain by addition of dideoxynucleotides 
1.0 to the grpvying chains. By starting with four reaction mixtures, which 
each contain one of the four ddNTPs. sets of strands terminating in A. C. 
G and T are generated. Each reaction mixture is electrophoresed in one 
lane of a polyacrylamitie gel that resolves fragments that differ by a 
single base in. length. 
15 Tbese methods are primarily designed for sequencing single- 

stranded DNA, which is produced by denaturing the DNA and separating 
the single strands, or by other methods, such as cloning into single- 
stranded phage vectors. These sequencing methods rely on reactions 
that produce an array of fragments that differ in length by a single base 
20. and terminate in an identifiable base. The fragments are resolved by size 
^ using PAGE and are detected using a label, such as a radioisotope, 
Because the resolution of bands on an electrophoretic gel decreases 
exppnentially as the length of the DNA fragments increase, these 
methods only permit (DNA fragments of up to about 300 to 400 
25 nucleotides to be sequenced. 

. The target DNA can also be amplified using the polymerase chain 
reaction (PCR) (see, e^. Mullis gigL (1986) Cold Sorinn H«rhnr Sy mp, 
Quant, Biol. 263; U.S. Patent No. 4,683,202 to Mullis et aL). which 
results in an amplified concentration of the duplex target DNA. A 
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number of methods have been used to generate single-stranded 
templates directly from PCR for subsequent sequencing. For example, 
radio-labeled primer that is specific for only one strand may be used. 
Alternatively, PCR may be run under conditions such that one primer is at 
5. limited conceritration. Once the primer that is at the lirhited 

concentration is exhausted, the second strand is amplified at a linear rate 
through succeeding cycles (Gyllenstein et aK (1 988) Proc. Natl. Acad. 
ScL 85: 7652; Mihilovic et aL (1989) BioTechnioiies 7 X0.) 

There are numerous sequencing strategies in use. The selected 
10 strategy depends upon the purpose for which the DNA Is sequenced and 
the amount of information available about the DNA prior to sequencing. 
For example, if the target DNA is sequenced in order to confirm that a 
particular mutation has been introduced into the DNA, it may only be 
necessary to sequence a small region of DNA. If the DNA fragment is an 
1 5 unknpyvn g ene or portion of a gene for which a sequence must be 

accurately determined, then it may be necessary to sequence the entire 
fragment. Because the sizes of fragments that can be sequenced are 
limited to about 400 bases, DNA fragments longer than this size must be 
cleaved. Cleavage may be random, by subcloning segments of the target 
20 DNA. The subcloned fragments, which include overlapping fragments, 
are then sequenced, and ordered using a computer program (see, 
Statlen (.1986) Nucl. Acids Res 1^.9171 Alternatively, the DNA may be 
systematically subcloned by generating and sequencing overlapping or 
nested mutants or by other ordered approaches. 
25 The Sanger chain termination method and other sequencing 

methods rely on the use of single-stranded template by cloning the target 
DNA into single-stranded phage vectors. The use of plasmids as vectors 
for the target DNA, however, is preferred over the us of phage DNA for 
reasons, which include the variety of available plasmids, the ease with 
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Which plasmids are manipulated, and the greater stability of inserted DMA 
in plasmids compared to phage Vectors. Consequentiy, methods for 
sequencing in which the target DNA Is cidried Into plasmid DNA, rather 
than Into a single-stranded phage DNA (see, e^, Wallance et aL d 981) 
5 fiSDSlg: 21-26; Quo fit aL (1982) NucK Anids Rp< . 2065-2084; 
Vieira §JaL 11982) fiane M: 259-268) have been developed. 

These methods are designed to only sequence one strand of the 
target DNA at a time (see, fi^ Chen ej aL (1985) DNA 4: 1 65-1 70- 

Hattori et aL (1986) Anfl. Piochftm 15?: 232-238; Mierendorf et aL 
10 (1987) Methods En/ymoL 152: 556; Mehra gj ah (1 986) Proe. Natl. 
Acad. Sci. USA 83: 701 3-701 7). Use of double-stranded DNA. 
however, avoids the subdoning or isolation of single-stranded DNA 
fragments, vyhich are used forthe dideoxy chain temiinator sequencing 
reactions. The use of double-stranded DNA, however, had been limited 
15 because of the poor template quality 6f denied duplex DNA. As a 
result, these methods had not provided as accurate sequence data as 
provided by methods in which the DNA is cloned into a single-stranded 



vector. 



Recently, the problems associated with template quality have been 
20 solved by the development of methods that use plasmids that Include 
sites, adjacent to the complementary strands of the Inserted DNA, to 
. which strand-specific primers may be efficiently hybrldteed. By virtue of 
, ; these methods each of the single- strands of doubie-stranded DNA can be 

sequenced directly from plasmid DNA without prior subclbhing into 
25 phage vectors (see, e^, Chen ej at {(1985) DNA 4: 165-170 and Chi si 
aL 11988) Nucl. Acids Rft^ ig: io382). A strand specific synthetic 
pnmer is annealed to covalently closed cirbular DNA, which has been 
denatured by heat or alkali, before proceeding with dideoxy sequencing 
reactions. AJternatively, the primer can be annealed to open circle 
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double-strandec! plasmid DNA, which has been denatured by alkali, as a 
template (see, Hattorl fij sL ((1986) Anal. Biochem. 152 : 232-238). The 
double-stranded DNA is denatured with alkali or heat prior to sequencing 
using the Sanger method; which is performed at 37" C or higher. The 
5 use of different "forward" and "reverse" primers, which are feach 
complementary to the lac Z sequences adjacent to the EfifiRI site in 
>4gt11, for separately sequencing each strar)d of DNA that has been 
cloned into /Igtll has also been described (see, Mehra e$ aK (1986) Proc. 
Natl. Acad. Sci: USA 83: 7013-7017). 

P P'asnfi'ds with oppositely oriented promoter regions, which are 

used in methods which involve transcription, are also used as vehicles 
for target DNA which is to be sequenced. Each promoter region serves 
as a distinct specific priming site for sequencing the iriserted DNA. Such 
plasmids are, commercially available. For example, the twin promoter 

5 plasmid pQEiyi™ contains the bacteriophage SP6 and T7 RNA polymerase 
prompters, in opppsrte.prientations (Mierendorf siaL (1987) Meth. 
InzSOnaLlSg: 556-562) which cari gerierate transcripts for sequencing. 

Most methods for sequencing nucleic acids require the use of 
pplyacrylamide gel electrophoresis (U„ PAGE) that can result in 

> sequencing artifacts or, require detectable labels, siich as radioisotopes, 
enzyrnes, pr fluorescent or chemiluminiBscent moieties. Methods for 
sequencing of DNA using mass spectrbmeric detection are also in use 
(see. U.S. Patent No. 5,547,835). These methods primarily rely on 
sequencing of DNA. 

i Using DN^: sequencing methodologies, the entire sequence of the 

human genome ultirriately will be determined. The knowledge of the 
complete sequence of the human genome DNA will help to understand, 
to diagnose, to prevent and to treat human diseases. To be able to 
tackle successfully the determination of the approximately 3 billion base 
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pairs of the human genome In a reasonable time frame and In an 
economical way, rapid, reliable, sensitive and inexpensive methods are 
ri^qulred. 

'* ^" ^^^^ to provide additional methods for 
5 sequencing. In particular, it is an pbiect herein to provide mass 
spectrometric methods of sequencing nucleic acids using RNA 
polymerase. It Is a further object herein to provide methods of 
sequencing nucleic acids in an array format using RNA polymerase in 
which nucleic acid probes are immbblliked to supports at high densities 
10 to facilitate mass spectrometric detection. It is also an object herein to 
provide methods for Identifying transcriptional terminator sequences 
usirig mass spectrometric methods. 
SUMMARY 0F;THE INVENTION 

Methods for sequencing nucleic acids, particuarly DNA. usi^^ 
15 transcrlpts,are provided. In particular, methods for sequencing nucleic 

acids using RNA polymerases, including DNA-dependent and RNA- 
dependent RNA polymerases, are provided. RNA transcripts are analyzed 
by mass spectrometry. 

In practicing the methods, a DNA riiolecule compri&Ing a promoter 
. 20 and target nucleic acid sequence operatively linked to the promoter is 
transcribed with an RNA polymerase that recognizes the promoter under 
conditions that generate a nested set of RNA transcripts that include the 
target is produced. The molecular weight of the transcripts are 
detemiined by mass spectrometry to thereby determine the nucleic acid 
25 sequence of the target sequence. The DNA molecule that contains the 
nucleic acid and target is preferably immobilized. 

In certain embodiments, the methods use a double-stranded 
promoter-containing fragment with a single-stranded region at one end. 
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vvhich captures a molecule that contains the target DNA. In these 
embodiments, the target.DNA is preferably single-stranded. 

In preferred embodiments, the promoter-^containing nucleic acid is 
, covalently coupled via the 3'-end of the noncoding (antlsense) strand or 
5 ,5'-end ,of the coding strand to a solid support. Coupling can be by any 
method known to those of skill iri the art, such as via a thiol linkage. For 
example, 5'- or.S^thiolated DNA is prepared and linked at to a 
amlnqsilane^treated solid support. The linkage may be direct or via a 
linker group. The resulting immobilized molecules are preferably arranged 
10 , in an array format. 

A double-stranded nucleic acid molecule encoding a promoter 
sequence cap obtained from any source, such as bacteria, viruses, 
bacteriophages, plants and eukaryotie organisms, or assembled by 
synthetic protocols. It is engineered using so that the resulting fragment 
15 contains a single-stranded region of at least a 5 nucleotides at the 3'-end 
. of the coding (sense) strand. This single-stranded region is designed 
such that it is complementary to a region tif the nucleic acid to be 
sequenced or to a common overlapping sequence, such as a restriction 
endonuclease site. 

^9 T^e target nucleic acid, which inlcudes the region of nucleic acid 

to be sequenced is single-stranded or double-stranded but includes a 
. jingle-stranded 3'-end, is hybridized to the complementary sequences of 
the prompter^containing DNA. Hybridization Is perfomied either before or 
after immobilization. The hybridization of the two nucleic acid 

25 molecules introduces one or more "nicks" in the hybrid at the iunction(s) 
Of the adjacent nucleic acid molecules. In certain embodiments, nicks in 
. the coding or non-coding strand, preferably the coding strand, are ligated 
by the addition of an appropriate nucleic acid ligase prior to initiating 
transcription. 
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The methods preferably employ any suitable strategy for 
generating nested RNA transcripts. The preferred methods rely on a 
modified Sanger sequencing strategy in which RNA polymerase is used 
to generate a set of nested RNA transcripts obtiained by base-specif id 
5. qhain terrnination. These are analyzed by mass spectrometry. A 
modified Gilbert-Maxam strategy is also contemplated. In this method, 
the transcripts are cleaved with base-specific RNases to pi-oduce base- 
specifically terminated fragments and the molecular weights are then 
assess by mass spectrometry. 
10 Transcription is initiated from the prorfioter by the addition of the 

appropriate RNA polymerase in the pres^ce of ribonucleoside 
triphosphates and a selected base-ispecific chain terminating 3'- 
depxyribdriucleoside triphosphate. In preferred embodiments, the 
transcription mixture also contains modified ribonucleoside triphosphates 
15 and ribonucleotide analogs ribonucleotide triphosphate analogs that 
reduce secondary structure of the resulting RNA transcripts and/or 
increase the fidelity of termination and turnover of the RNA polymerase 
enzyme thereby Increasing the amount of RNA transcript available for 
analysis.. These analogs include, but are not limited to: inosine 5'- 
20 triphosphate (ITP), which reduces secondary structure. 4-thio uridine 
5'triphosphate (UTP) and 5-bromo UTP or 5'-iodb CTP, which increase 
the fidelity of termination and turnover of the RNA polymerase enzyme 
thereby increasing the amount of RNA transcript available for analgia, 
the resulting RNA transcripts are analyzed by mass spectrometry. 
25 In other embodiments, the sample further contains a matrix 

material for mass spectrometry for analysis by matrix-assisted laser 
desorptjon/ionization mass spectrometry fMALDI) and preferably further 
uses time-of-flight (TOP) analysis. The sequence of the nucleic acid is 
obtained by aligning the obs rved mass of the chain-terminated RNA 
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transcripts obtained from sequencing reactions containing each of tlie 
four chain-terminating bases. 

The methods of sequencing provided herein may be used for 
diagnostic applications. Diagnostic applications; include methods for 
; , 5 detern;iination of the presence of genetic alteratioris in a known target 
nucleic acid. For example, a region of the target nucleic acid Is amplified 
and the. nucleic acid strand corresponding to the noncodlng strand is 
isolated. The nucleic acid probe containing the promoter may be isolated 
from a natural source or assembled synthetically by hybridizing two 
10 complementary oligonucleotides to form a promoter sequence. A sin- 
gle-stranded region of at least 5 nucleotides that is complementary to a 
region of the nucleic acid to be sequenced or to a common sequence is 
introduced by recombinant means at the 3'-end of the coding strand. In 
^ preferred embodiments, the promoter-containing nucleic acid is 
15 covalently coupled via the 3'-end of the noncodirig strand or 5'-end of 
the coding strand to a sol id -support and, more preferably; is a 5'- or 3'- 
thiplated DNA linked at high densities to a aminosilane-treated solid 
support. The linkage may be in the absence or presence of a linker group 

20 , A. singl^stranded, ?' overhang of the nucleic acid to be sequenced, 
in single-stranded or double-stranded form, is hybridized to the 
cornplerpentary sequwes. of the noncoding strand and, ^i^ 
embodiments, the nlck(s) between one or more nucleic acid strands 
is/are ligated prior to .transcription. Transcription is initiated using the 

25 appropriate RNA polymerase in the presence of ribonuclebside 
triphosphates and a selected base-specific chain terminating 3'- 
debxyribonucleoside triphosphate. In preferred embodiments, the 
transcription mixture may, also contain inosine 5'-triphosphate and/or one 
or more modified ribohucleoside triphosphates to facilitate analysis of the 



SUBSTITUTE SHEET (RULE 26) 



wo 99/31278 



PCt/OS98«67W 



-10- 

RNA transcripts. The RNA transcripfs are analyzed by mass 
spectrometry, preferably using MALDI^TOF. 

When used In array formats, a panel of promoter-containing 
nucleic acid probes can be constructed and linked Ir^ an irray such that 
6 the target nucleic acid is jiermuted'alohg'the eritire sei|uence, aa^, the 
coding sequence of a gene, allowing for the detefhilnatidn of the nucleic 
acid sequence of the entire gene during a single reaction sequence. 

Methods of identifying transcnptional tei-minator arid attenuator 
sequences are also provided. By modifying transcription conditions to 
10 produce full-length transcripts, transcriptional terminator sequences, such 
,as rho-dependent and rhonndepehdent termihatbrs, rhay be identified 
using mass spectrometric methods; In practicing the methods, a single- 
stranded region of the 3%end of the target nucleic acid to be sequenced 
is hybridized to a complementary sequence at the 3'-end of the coding 
1 5 strand a promoter-containing nucleic acid probe. In preferred 

embodiments, the promoter.containing nucleic acid is cbvalently coupled 
via the 5'-end of the noncoding strand or 3'-end of the boding strand to a 
solid support and, more preferably, is a 5'- or S'-thiolated DNA linked at 
high densities to a aminosilane-treated dolid support. The linkage may be 
20 in the absence or presence of a linker group ar^d is preferably arranged in 
an array format. : 

Transcription is initiated in the absence or presence of modified 
RNA triphosphate analogs that in(ire&se the efficiency of RNA polymerase 
termination at such terminator sequences, such a¥ 4-thio UtP, 5-bromo 
25 UTP or 5'-iodo CTP. In certain embodiments, hlck^ In ohb or more strand 
may be llgated by the addition of ah appropriate nucleic abid ligase prior 
to initiating transcription (Le,, adding a DMA or RNIa ligase). The mass of 
the terminated RNA transcripts is determined by mass spectrometry. The 
observed mass is indicative of the location of the terminator^ependent 
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^arrest 6f transcription and by comparing the altgnnnent of the sequence 
immediately prjBceding the site of transcriptional termination from distinct 
genomic locations, terminator and attenuator sequences may be 
identified for different RN A polymerases. 
^5 DESCRIPTION GF THE DRAWINGS 

Figure 1 illustrates a system for preparing arrays of a sample 
. :material for analysis. 

Figure 2 illustrates a pin assembly suitable for use with the system 
depicted in Fig. 1 for implementing a parallel process of dispensing 
10 material to a surface of a substrate. 

Figure 3 depicts a bottom portion of the assembly shown in Fig, 2. 
Figure 4 depicts an alternative view of the bottom portion. of Jhe 
pin assembly depicted in Fig. 2. 

, Figyres 5A-5D depict a method for preparing an array of sample 
15 material. 

Figures 6A-6B depict an alternative aissembly for dispensing 
material to the surface of a substrate. 

Figure 7 is a schematic showing covalent attachment of 
oligodeoxynucleptides to a silicon dioxide surface as described herein. In 

20 partipular, silicon dioxide was reacted with 3-amihdprbpyltriethoxysilane 
to produce a uniform layer of primary amino groups on the surface. A 
heterobifunctjonal crosslinking agent was then reacted with the primary 
amine to iricorporate an iodoacetamide-group. An oligodeoxynucleotide 
containing a 3'- or 5'rdisulfide (shown as the 5') was treated with tris-{2- 

25 carboxyethyl) phosphine (TCEP) to reduce the disulfide to a free thiol, 
which was then coupled to the iodoacetamido-surface. 
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Figure 8 Is a graph which plots conjugation of bii^odeoxynucleo- 
tidQ probes to a silicon surface as a function of TCEP concentration used 
in the disulfide reduction. 

Figure 9 is a matrix assisted laser desorbtion/ibnizatjon-time-of- 
5 flight (MALDI-TOF) mass spectrum of a silicoh Wafer w^^ the 

pligodeoxynucleotide sequence denoted •'TCUC- (5'-GAATTCGAGCTCG 
GTACCCGG-3'; SEQ ID NO: 1) covalentlY bound essehtially as described 
in Figure 7 and the oligodeoxynucleotide sequence denoted "MJMB" (5'- 
CCGGGTAeCGAGCTGGAATTC-3'; SEQ ID NO: 2) hybridized thereto. 
10 Figure 1 0 depicts one embodiment of a subSltrate having wells 

etched therein that are suitable for receiving material for analysis. 

Figure 1 1 depicts one example of spectra obtained from a linear 
time of flight mass spectrometer instrument and representative of the 
material composition of the sample material brt the surface of the 
15 substrate depicted in Fig. 10. 

Figure 12 depicts molecular weightii determined for the sample 
material having spectra identified in Fig. 1 1 . 

Figure 1 3 is a schematic of a 4 x 4 (1 6-iocati6n) DNA array on the 

surface of a silicon wafer with the thiol-coritaining oligonucleotide 

20 molecules denoted "Oligomer 1 (S'-CTGGATGGGTCggATCATCTIT^ 
SEQ ID NO: 5), Oligomer 2 (5HS)-CCTGTTbGGAACT^^ 
ID IVIQ: 6) €|nd ^Oligomer 3" (SEQ ID NO: 1; a free thiol derh^ative 
"TCUC* oligonucleotide of EXAMPLE 1 ) covalehtiy bound to 1 6 locations 
on the surface of the silicon wafer essentially as described in 

25 EXAMPLE 2. , 

Figure 1 4 is a schematic of the hybridization of specific 
oligonucleotides to each of the 1 6 locations of the DNA hybridization 
array of Figure 13 with the Oligomer 1 complementary oligonucleotide 
(5'-GATGATCCGACGCATCAGAATGT-3'; SEQ ID NO: 7) bound to 
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Oligomer 1 , the Oligomer 2 complementary oligonucleotide (5'- 
AATACTACACAG-3'; SEQ ID NO: 8) bound to Oligomer 2 and the 
Oligomer 3 complementary oligonucleotide (5'- 

CCGGGTACCGAGCTCGAATTC-3'; SEQ ID NO: 2) bound to Oligomer 3. 
5 Figure 15 is a representative MALDI-TOF mass spectrum of a 4 x 4 

(16:location) DNA array on a silicon wafer shown schematically in Figure 
15. The spectrum reveals a single, predominant signal of an 
experimental mass-to-charge ratio in each location corresponding to the 
specific hybridized pligonuGleotides. The 2+ indicates the position of a 

10 doubly charged molecule used as a reference standard during MALDI-TOF 
MS analysis. The * denotes residual amounts of contaminating 
oligonucleotide that remain on the surface of the chip following washing 
procedures. The relative position of the * signal reveals the approximate 
size of the contaminating oligonucleotide. 

1 5 Figure 1 6 is a representative MALDI-TOF mass spectrum of an 8 x 

8 (64-location) DNA array.. The spectrum reveals a single, predominant 
signal of an;experimental mass-to-rcharge ratio corresponding to the 
predicted specific hybridized pligonucieotides. The * ddnotes residual 
amounts of contaminating oligonucleotide that remain on the surface of 

20 the wafer following washing rprocedures. The relative position of the * 
signal reveals the approximate size of the contaminating oligonucleotide. 

Figure 17 shows the nucleotide sequence of a DNA molecule (SEQ 
ID No: 10) assembled by hybridizing a 5 5-mer oligonucleotide to a 
complementary 25-mer oligonuicleotide (SEQ ID No: 1 1 ) and a 

25 complementary 30-mer oligonucleotide (SEQ ID No: 12). The resulting 
double-stranded DNA encodes e SP6 promoter (nt 1 -1 8 of SEQ ID No: 
10) and has a single nick in the coding strand of the molecule at nt +7 
relative to the start of transcription from the SP6 promoter. The position 
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of the nick and the start of transcription initiation from the SP6 promoter 
are indicated. 

DETAILED DESCRIPnON AND PREFERRED EMBODIMENTS 
Definitions 

5 Unless defined otherwise/ all technical and scientific terrns used 

herein have the same meaning as is commonly understood by one of skill 
in the art to which this invention belongs. All patents and publications 
referred to herein are Incorporated by reference herein. 

As used herein, the term "nucleic acid'' refers to oligonucleotides 
10 or polynucleotides such as deoxyribonucleic acid (DNA) and ribonucleic 
acid (RISIA» as well as analogs of either RNA or DNA, for example, made 
from nucleotide analogs, any of which are in single or double-stranded 
form and also PNA. Nucleic acid molecules can be synthetic or can be 
isolated from a particular biological sample using any number of 
15 procedures which are well-known in the art, the particular procedure 
chosen being appropriate for the particluar biological sample. 

As used herein, nucleotides include nucleoside mono-, di-, and 
triphosphates. Nucleotides also include modified nucleotides such as 

phosphorothioate nucleotides and deazapurine fiucleotides. A complete 
20 set of chain-elongating nucleotides refers to four different nucleotides 

that can hybridize to each of the four different base^ comprising t^^^ 
, tenriplate. 

As used herein, a nucleic acid prdmoter-contairiihg probe refers to 
a nucleic acid fragment that includes a double-stranded region encoding a 
?5 prompter and a single-stranded region that contains at l^ast S.nucleotldes 
at the 3'-end of the codirtg (sense) strand relative to the promoter. The 
at least 5 nucleotides are selected to be complementary to a single- 
stranded region at the S'-end of a nuci Ic acid coritaining the target 
nucleic acid of Interest. Reference to the 3'-end refers to the antisense 
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strand if the target is contained in a double-stranded molecule. When the 
target Is double-stranded the resulting molecule has nicks in the coding 
and non-coding strands. 

. Alternatively, the single-stranded region of the nucleic acid 
B promoter-containing probe includes at least about 5 nucleotides at the 5'- 
end of the noncpding (antisense) strand (relative to the promoter) that 
are complementary to at leastiabout five nucleotides at the 5'-end of the 
coding strand of a target nucleic acid. In this erribodiment, the target 
must be double-stranded. Following hybridization, the resulting 
10 molecule has nicks in the coding and non-coding strands. 

In all embodiments, nicks can be ligated prior to transcription. As 
shown herein, however, if the nicks are at a sufffcleht distance, 
preferably at least about +7 nucleotides, it is not necessary to ligate. 
As used here in, the target nucleic acid is the niidleic acid 
15 molecule to be sequenced. It can be part of a larger moleucle. The 
molecule that contains the target nucleic acid can contain or can be 
modified to contain at least about 5 nucleotides whose sequence is 
designed to be fully complementary to at least 5 contiguous nucleotides 
at ttie 3' end of the over hang on the promoter-containing capture probe 
20 (or 5' end depending upon the configuratioh selected). The target 
nucleic acid can be double-stranded, with a 3* or 5' single-stranded 
overhang , or single-stranded. RNA polymerases can and do transcribe 
slngle-stranded molecules, particularly, once the ternary complex is 
forrned. The target may. also be RNA or PNA (protein nucleic acid formed 
25 by conjugating bases to an amino acid backbone; see, e^, Nielsen et 
aL (1991) Science 254 :1497). 

As used herein, nucleic acid synthesis refers to any process by 
which oligonucleotides or polynucleotides are generated, including, but 
not limited to processes involving chemical or enzymatic reactions. 
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As used herein, a bas&'Specifically terminated ribonucleotides is 
one that generated during transcription by Iricbrporation a nucleotide that 
results in transcription termination. Base-speclficaNy ternilnating 
: 5'***""'?'®***'^^ ^P'^OP*^^^^' which produce base-specificaily terminated 
5 ribonucleotides, are known to those of skill in the art. Examples of base- 
specifically terminating ribohucjeoside trlphosphalies include, but are not 
limited to: :3'-deoxyribonucleosidetriphosphatei5, such as 3'-d6TP, and 
Others described herein and known to those Of skill In the art. 

As used herein, complementary when referring to two nucleotide 
10 sequences, means that the two sequences of nucleotides are capable of 

hybridizing, preferably with less than 25%. more preferably with less 
, . than 16%, even more preferably with less than 5%, most preferably with 
no mismatches between opposed nucleotides. Preferably the two 
molecules Will hybridize under conditions of high stringency. 
15 As used herein: stringency of hybridization in determining 

percentage mismatch is as follows: 
. 1) high stringency: O.T x SSPE. 0.1 % SDS, ee^C 
. 2) medium stringency: 0.2 x SSPE, 0.1 % SDS, SO^C 
3) low stringency: 1.0 x SSPE, 0.1% SDS, 50«»C 
20 It is understood that equivalent stringencies may be achieved using 
alternative buffers, salts and temperatures; 

As used herein, the term "array" refers to an ordered arrangement 
of members pr ppsitiohs. The array may contain any numlier of members 
or positions and, can be In any variety of shapes. In preferred 
25. ernhodiments, the array is two-dimensional and cont9ins n x m members, 
wherein m and n are integers that can be the same or different. In 
particularly preferred embodiments, n and m are each 4 or a multiple 
thereof. . 
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The term "cross-linking agent" is art^recoghlzed, and, as used 
herein, refers to reagents taht immobilize a nucleic acid to an insoluble- 
support, preferably through cdvalent bonds. Thus, appropriate "cross- 
linking agents" for use herein includes a variety of agents that are 
5, capable of reacting with a functional group present bh a surface of the 
, : insoluble support and with a functional group present in the nucleic acid 
molecule. Reagents capable of such reactivity include homo- and hetero- 
bifunctional reagents; rnany of which are known in the art. 
Heterobifunctionai reagents are preferred. 

10 As used herein, the term "thiol-reactive functionality," refers to a 

^functionality that reacts rapidly or preferably with a nucleophilic thiol 

. moiety to produce^ a covalent bond, such as a disulfide or thioether bond. 
In general, thiol groups are good nuclebphiles, arid preferred thiol-reactive 
functionalities are reactive electrophiles. A variety of thiol-reactive 

15 fuhctibnalities are known in the art; and include, for example, haloacetyls 
(preferably iodoacetyl), diazoketones, epoxy ketones, a, B-unsaturated 
carbonyls (such as, a, B-enohes) and other reactive Michael acceptors 
. , (including maleimide), acid halides, benzyl halides, and the like. In 
certain embpidiments; a free thiol group of a disulfide cah' react with a 

2Q; free thiol group; such as by disulfide bond formation, including by 
. disulfide exchange. A "thiolrreactive'^ dross-linking agent, as used 
herein, refers to a cross-linking reagent (or surface) which includes, or 
can be rnpdified to include^ at least one thiol-reactive functionality. 
Reaction of a thiol group can be temporarily prevented by blocking with 

25 an appropriate protecting group, as is conventionial in the art (see e.g,. 
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T.W. Greene and P.G.M. Wuts "Protective Groups in Organic Synthesis, ' 
2nd ed, John Wiley & Sons, (1991)). 

As used herein, a selectively cleavabfe linker is a linker that is 
cleaved under selected conditions, such as a photpcleavable linker, a 
5 chenriically cieavable linker and an enzymatically cleavable linker (Is^ a 
restriction endonuclease site or a ribonucleotide/RNase digestion). The 
linker is interposed between the support arid immobilized DNA. 

As used herein, the tirms "protein", "polypeptide" and "peptide" 
are used interchangeably when referring to a translated nucleic acid, 
10 such as a gene product. 

As used herein, •'sample" refers to a composition containing a 
material to be detected. In a preferred embodiment, the sample is a 
"biological sample", which refers to any material obtained from a living 
source, including but not limited to animal, particularly mammals, 

15 including humans, plant, bacteria, fungi, protlst and virus. The biological 
sample can be in any form, including solid materials, such as tissue, cells 
and biopsied material, and biological fluids, including urine, blood, saliva, 
amniotic fluid, cerebral spinal fluid and mouth wash (containing buccal 
cells). Preferably solid materials are mixed with a fluid; 

20 As used herein, "substrate" or "solid support" sH^ 

insoluble support onto which^a sample is deposited adboraing to the 
- materials as described herein. Examples of appropriate substrates 
include beads, including but not limitbd to silica gel, cohtrblled pore 
glass, magnetic, dextran, agarose and cellulose, capillaribs, flat supports 

25 such as glass fiber filters, glass surfaces, metal surfaces (steel, gold. 
Silver, aluminum, copper and silicon), plastic materials including multiwell 
plates or membranes {e^ of polyethylene, polypropylene, polyamide, 
polyvinylldenedifluoride), pins , such as arrays of pins suitable for 
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combinatorlal synthesis or analysis or beads in pits of flat surfaces such 
as wafers, pariticularly silicon wafers, with or without plates. 

As used herein; RNA polymerase refers to DNA-dependent RNA 
polymerases and RNA-dependent RNA polymerases. Any RNA 
5 polymerase that recognizes a specific promoter sequence and is capable 
of initiating transcription and elongating a RNA transcript is contemplated 
withia the scope of the term herein. Exemplary RNA polymerases that 
may be used in the methods provided herein include, but are hot limited 
to those obtained from: 1) archeabacteria, such as Halobacterium. 
^0 Methanobacterium. Methanococcus. Sulfolobales and Thermoolasma : 2) 
eubacteria, such as gram negative bacteria, e.g.; Escherichia coli and 
strains of Salmonella and Shigella, gram positivie bacteria; e.g.. Bacillus 
subtilis and Staphlococcus aureus : 3) bacteriophages, such as T7, T3, 
SP6, SP6 nicked and N4; 4) DNA viruses; 5) RNA viruses; such as 
15 influenza virus; 6) plants, such as wheat; and 7) eukaryotic RNA 

polymerase II Isolated from fungii, e.g., Saccharorriy ces cerevisae and 
higher eukaryotic organisms, e^, mammals. Also included within the 
scope of th? term. RNA polymerase as used herein is the RNA phage 
replicase (see, e^fl^, International PCT application Nt):PCT/US87/00880 
20 and 5,270,184). . Eukaryotic RNA polymerases, suc^ as those isolated 
from eukai^otjc tissues, including plants and aniifnals; that recognize 
eukaryotic prompters, including viral promoters from viruses that infect 
eukaryotic cells, are also contemplated herein. The T7, T3, and SP6 
RNA polymerases have been cloned and their sequences and methods for 
25 their purification as well as for purification of the bacterial RNA 

polymerases and certain bukaryotic DNA dependent RNA polymerases 
are known. In addition, nurnerous RNA polymerase commercially 
available in large quantities, and can also be readily produced from 
commerically available vectors. The sequences of the promoters for each 
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of the above-noted polymerases and other RNA pdiyindrase are also 
known to those of skill in the art« 

As used herein,, a promoter region refers to the portion of DNA of 
£ a gene that controls expression of DNA to which it is operatively linked. 
5 The promoter region includes specific sequences- of DNA that are 
: sufficient for RNA polymerase recognition, binding and transcription 

initiation. This portion of the promoter region is referre^^ 
. . promoter. In addition, the prorhoter region includes sequences that 
modulate this recognitioni binding and traniscription initiation activity of 
TO the, RNA pdiymerase, These sequences may be cis acting or may be 
responsive to trans Promoters, depending upon the 

nature of the regulation, may be constitutive or regulated. A constitutive 
promoter is always turned OH. A regulatable promoter requires specific 
signals to be turned on or off . A developmental ly regulated promoter is 
15 one that is turned on or off as a function of development. 

The promoter can be of a; consensus sequence or variant. When a non- 
wild-type promoter is used, transcription will occur at a rate sufficient to 
produce a detectable transcript, and is typically at least about 5-10% of 
the rate at which transcription would have occurred if a wild-type or 
20 native promoter had been used by the RNA polymerase to transcribe the 
nucleic acid jn vitro. Each species of RNA pblymSrase is specific 
for a particular promoter sequance. Each of the three bacteriophage 
RNA polymerases (T7, T3^ and SP6) has its own specific DNA promoter 
sequence. After the RNA polyrtierase has attached to the prompter 
25 sequenca of the template and treinscriptlon has b^en Initiated, the 

promoter sequance releases the RNA polymerase. Th^ RNA polymerase 
then continues transcription along the terhplate until (1) It runs out of 
RNA starting materials, (2) the transcription reaction is terminated by a 
^ particular nucleotide, nucleoside or analog. (3) transcription is 
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spontaneoiisiy aborted when the RNA polymerase and the template 
prematurely disassociate or (4) the end of the DNA template is reached. 
As used herein; a "promoter-containing nucleic acid" is a nucleic 
, acid that contains a sequence of hucieptides thiat directs the site-specific 
5 binding of an RNA polymerase molecule ; which binds and melts the 
double-stranded DNA to form an open transcription initiation complex 
that is capable of initiating RNA synthesis iri the presence of 
ribonucleotide triphosphates. Thus, a functibnal promoter sequence is a 
sequence of nucleotides to which a DNA dependent RNA polymerase 
10 binds, melts in, forms an initiation complex and initiates transcription. 
The promoter-containinjg nucleic acid capture probe refers to the double- 
stranded molecule the includes the'promotei' region iand a 3' overhang of 
at least about 5 nucleotides to which the target hucleic acid hybridizes, 
particularly under high stmgency conditions, Typically such hybridization 
15 requires about 5 contiguous base pairs of identity between the target and 
promoter-containing capture probe. After initiating transcription, the 
RNA polymerase (or subiinrts thereof) is part of ternary complex 
containing the polymerase, the DNA, and the nascent RNA transcript. 

A? used herein, a "coding strand •^ referis to the nucleic acid strand 
20 of a promotei^-containing nucleic acid that has the saMe polarity as a 
corresponding rtiRN A mplecule initiated f rorh that prbrribter. Thus, it 
refers to the sense strand. The nbh^codihg strand is the anti-sense 
:strand that is transcribed by RKI A pdlyme^^^^^^ 

. . As used herein "native nucleoside (or riboniicleoside) triphosphate" 
25 or "native nucleotide" or ribonuclebtidis triphosphate means any one of 
the naturally occurring normal precursbrs of RNA, i .e., ATP, GTP, CTP, or 
UTP. A "nucleoside triphosphate analog" or"hucleotid analog" means 
any nucleoside triphosphat analogous to a native nucleotide, but that 
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contains one or more chemical modifications compared to the native 
nucleoside or nucleotide. > 

As used herein, a " matrix materiar refers to a material used in 
mass spectroriietry that is a proton donatlngi UV absorbing material, 
$ usually an organic acid, that forms crystalline matrix^nucleic acid 

structures that are readily ionizable during MALDI.: An exemplary matrix 
niaterial is a solution of 3-hydroxypicolinic acid (3-HPA, 0.7 M in ,50% 
acetonitrile, 10% ammonium citrate); 

As used herein, "conditioning" refers to a process whereby prior to 
''P rfnass spectrpmetriG analysis, the nucleic acid to be analysed is modified, 
for example to deprease.th^ laser energy required for volatization and/or 
to minlmfee fragmentationv C performed on a nucleic 

. acid, preferably aft#r immobilization. An example of conditioning is 
rnodification of tha phosphodiester backbone of the nucleic acid molecule 
15 by, for example, cation exchange), eliminates peak broadening due to a 
heterogeneity in the cations bound per nucleotide unit. Contacting a 
, nucleic acid molecule with an alkylating agent such as alkyliodide, 

iodoacetamide, -iodoethanol, or 2,3-epoxy-1-propanol. the monothio 
phosphodiester bonds of a nucleic acid molecule can be transformed into 

20 a phpsphotriester bond. Likewise, phosphodiester bonds may be 
transforrned to uncharged derivatives employing trialkyfeilyl chlorides. 
Conditioning also can be performed by incorporating modified nucleotides 
into a nucleic acid during synthesis. Such modified nucleotides can be 
used to reduce sensitivity for depurinatlon (fragmentation during MS), 

25 , sugg as with a purine analog such as M6-, N7^ or NS-deazapurine 
, nucieotides or ribonucleotide, Modified nucleotides also can include RNA 
_ building blocks, oligonucleotide triesters, phosphorothioate functions 
which ar alkylated, or oligonucleotide mimetics such as PNA. In 
preparing condition d RNA transcripts with modified nucleotides, the 
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RNA polymerase chosen must be able to incorporate the modified 
nucleotide into the RNA transcript. 
Methods for sequencing 

Mass spectrpmetric methods of sequencing nucleic acids are 
5 provided, in particular, the sequencing metliods use nucleic acid 
promoter-containing probe that contain a double-stranded region 
encoding a promoter and a single-stranded region for hybridizing to a 
complementarY region on target nucleic acids. The nucleic acid 
promoter-containing capture pr-obe is preferably immobized on a solid 
.10 support. The single-stranded region is hybridized to a compierhentary on 
a target nucleic acid, which contains the nucleic aicld to be sequenced 
and at least 5 nucleotides of single-stranded region identical to the 5 
. contiguous single-stranded nucleotides on the probe. 

Following hybridization of the target to: the nucleic acid promoter- 
15 containing probes, the nucleic acid sequence is determined by generating 
a set of nested base-specific chain terminated RNA transcripts initiated 
fi-pm the promoter and transcribed through the target hudleic acid. The 
resulting nested RNA transcripts are analyzed using mass spectrometry. 
While DNA is often the preferred vehicle for sequencing, the 
20 methods herein. are sequence RNA transcripts. RNA fragments are more 
. stable during rnatrix-asslsted laser desorptibh/loniza'tion (MALDI) mass 
§pectroi7ietry than DNA fragments; Without being bound by any theory, 
■die enhanced stability may result from the presence of the 2'-hydroxyl 
group on the sugar moiety of RNA, which helps to reduce depurination 
25 during the MALDI process. 
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RNA polymerases and promoters 

Any RNA polymerase capable of directing the ia )to transcription 
of an RNA molecule is contemplated for use In the methods described 
herein. Preferred RNA polymerases are DNA-dependent RNA 
; S polymerases. Methods for isolating and purifying RNA polymerase 

molecules are well known to those of skill in the art: 00 replicase (seee, 
U.S, Patent No. ,5,696,249, Re: 35,443 ahd Eoyang ah (1971) in 
Procedures in Nucleic Acid Resfflrrh , Gantoni arid Davies, eds.. Volume 
2, pp. 829-839, Harper and Rowe. NY); bacteria (e^, E, colj, see 
10 Burgess.and Jendrisak H 975) BiochemistQi 14:4634-4638 and Hager et 
§L n99p)fii2sj3sniistDf 29:7890-78 e^. see Sadhukhan 

g£ aL n 9m Mol. Cell. Bioohem . JH: 1 05-1 1 4; BaciHus gubtiljs, 
Giacomoni ( 1 980) Eur. J. Bioi^h^m i n«.R-7Q_qq ^ . p^^^^ 
McDonnell et at (1 977) JJVIoLBio^ 
15 (1969) \^iro!oaif 39:562-574; see also He « ah (1997) Protein.^ 
PuriL S; 1 42- 1 5 1 ); viruses (e^ rhabdovirus, see Das et ah ( 1 996) 
Methods Enzymol. 225:99-1 22; turnip mosiac virus, see Deidman ^ aL 
(1 997) J. Viral Meth. M:184-195; vaccinia, Gershon et aL (1 996) 
Methods Enrymol. 225:35-57); and mammalian (g^ yeast polll, Koleske 
20 eiaL(1996) Methods Enzymol. 223:176-184; humart polll, Maldonado 
, eJ aL (1996) Methods Enzymnl 27^:72.100). In additioriVa number of 
prokaryotic, eukaryotic, bacteriophage and viral RNA polymerases, such 
as T3, T7 and SP6 are commercially available (e^, sold by Stratagene, 
U Jolla, CA; Boehringer Mannheim, Indianapolis, IN; Pharmacia, Uppsala, 
25 Sweden; and Sigma Chemical Corp,; St. Louis. Mo) and/or well known. 
The promoters are also well known and readily available the pGEM 
series of plasmlds are available from Promega, Madison Wl; see also, 
U.S. Patent No. 4,766.072, which describes construction of the pGEM 
plasmids). 
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. In practicing the methods, DI^A-dependent RNA polymerases and 
RNA<^dependent I^NA polymerases may be used. For example, RNA 
polymerases that may be used in the methods provided herein include, 
but are not limited to those obtained from: 1) archeabacteria, such as 
T Halobacterium. MethanobantArii im Methanoeoccus . Sulfolobales and 
Thermoplasma; 2) eubaeteria, such as gram negative bacteria, e.g.. 
Escherichia coH and strains of Salmonetia and Shigella; gram positive 
bacteria, e.g.. Bacillus subtilis and Staphlococctjs aureus: 3) 
bacteriophages, such as T7, T3, SP6 and N4; 4) DNA viruses; 5) RNA 
1p viruses, such as influenza virus; 6) plants and plant viruses, such as 
yyheat and.turnip rnpsiac virus; and 7) eukaryotic RNA polymerase II 
isolated from fungi, e^g.. Saccharomvnfts rftrbwieg^ an^ higher eukaryotic 
organisms, e^ mammals (e^g^ for a review see In RNA Pohmierase and 
the Regulation of Transcription , Rezinkoff et aL, eds, Elsevier, NY). Also 
15 included for use herein is the replicase from the RNA phage (e^ 
see U.S. Patent Nos. 5,670,353, 5,696,249 and Re: 35,443). 

The selection of the appropriate RNA pblymerase for a nucleic acid 
template to be sequenced is within the skill of the skilled artisan and 
varie? according to the nucleic acid molecule to be sequenced and the 
2b selected promoter region. The selection may be determined empirically 
. , following using teachings known to those of skill in the art, including 
thqse described herein, Preferred nucleic acid molecules to be 
sequenced are DNA molecules. RNA and, particularly, PNA {protein 
nucleic acidis) are also amenable to sequencing, upon selection of an 
25 appropriate RNA polymerase: 

Each nucleic acid promoter-containing probe used In the 
seqijencing methods described herein contains a promoter. The 
promoters used in the methods herein can be obtained from any source 
or can be prepared synthetically based upon sequences recognized by 
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the selected RNA polymerase. For exarripie; the nucleic acid containing a 
promoter may be obtained directly from a i/ariety of different organisms, 
such as bacteria, viruses, phage and eulcaryotic organisms, by cloning or 

^ may be obtained from commercially available e^^^^ (e.g.. 
^ T7^J3, SP6 and a?^«* -prom^^ Bbehringer Mannheim and 

Pharmacia; bja or lac promoters. RSV-LTR promoter arid f 9-1 promoter; 
Stratagene) . The selection of th6 appropriate promoter will depend on 
the nucleic acid to be sequenced, sequencing conditions/ and most 
importantly, on the RNA polymerase selected for trahscription. 

,1,0 Promoters can be produced by hybridizing oligonucleotides 

encoding the coding and non-coding DNA strands of a promoter (see 
Example 4), The separate strarids of nucleic acid are obtained preferably 
by nucleic acid synthetic techniques. Oligonucleotides lised to assemble 
a promoter are designed such that following hybridization, there exists a 

15 single stranded region at the 3'-end of the coding strand to which target 
DMA can hybridize. 

Immobilization of nucleic acid promoter-cbhtaining probes 
In preferred embodiments, the nucleic acid prohloter-containing 
probe is immobilized, directly or by means of a crbss-linking agent, to a 

20 solid support provided herein. The probe can be immobilized prior to 
hybridization or subsequent to hybridizatidh to the target. Preferably and 
most conveniently hybridization is effected following imnidbilization 
the solid support. The probe is preferably immobilized via the ^^5 
the.coding strand or the 3' end of the n6h-cd(ling Strand, 

25 '"Other embodiments, a promoter ahd a doWrtsfr6am target 

nucleic acid can be isolated as a transcriptional lihit from genomic DNA 
by amplification with primers or hybridization cloning technology DNA 
encoding the promoter and target can then be Immobiiized onto a solid 
via the terminal nucleotides of the 5' end of the coding strand (sense 
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. : Strand) or the 3'-end of the non-coding strand. As discussed further 
below; a linking group can be Incorporated into PGR primers used to 
isolate the promoter-containing nucleiic acid template to achieve directed 
immobilization to a support. Use of a suitable RNA polymerase that 
5 recognizes the promoter;^ will in the presehce of chain terminating 
nucleotides will result in a set of nested fragments from which the 
. . sequence can be determined. 

Preferred solid supports are those that can support linkage of 
nucleic acids thereto at high densities, preferably such that the 
10 covalently bound nucleic acids are present on the substrate at a density 
of at least about 20 fmol/mm^, more preferably at least about 75 
fmol/mm^, still more preferably at least about 85 f mol/nim^, yet more 
preferably at least .about 1 00 f mol/mm^/ and most preferably at least 
about 150 fmol/mm^. Among the most preferred substrates for use in 
15 the particular methods of immobilizing nucleic acids to substrates 
provided herein is silicon, whereas less preferred substrates include 
„pplyrneric materials such as;polyacrylamide. Substrates for use in 
methods of producing arrays provided herein include any of a wide 
variety of insoluble support materials including, but not limited to silica 
20. gel, controlled pore glass, cellulose^ glass fiber filters/ glass surfaces, 
, r^^ surfaces (steel,.gold, silYer> aluminum, silicon and copper), plastic 
materials (e^,. of polyethylene, polypropylene, poiyamide, polyvinyldene^ 
. difluoride) and silicon. 

In embodiments of the methods in which a cross-linking reagent is 
25 not employed, a modified nucleic acid is reacted directly with a 

appropriately functionalized surface to yield immobilized nucleic acid. 
Thus, for example, an iodoacetyl-modified surface <or other thiol-reactive 
surface functionality) can react with a thiol-modified nucleic acid to 
provide immobilized nucleic acids. 
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; In embodiments using a cross-linking agent/ the cross-iinking 
is seiebted to provide a high density of nucleic acids immobilized on the 
Insoluble support! The cross-linkina agent (artd other reagents used to 
- : f unctionalize the support suHace or the nucleic acid molecule) can be 
5 selected to provide any desired spacing of the immobilized nucleic acW 
; molecules from the support surface, and to prbvid^ any desired spacing 
of the Immobilized nudeic acids from toch other. Thus, steric 
encumbrance of the nucleic acidmolecules can be reduced or eliminated 
by choice of an appropriate cross-linking agent. In certain embodiments, 
1 p the cross,linking reagent can be selected to provide multiple reactive 
. f ynctlonalities as used in dendrimer synthesis for attachment of multiple 
nticleic acids to a single cross-linking moiety. Preferably, the cross- 
linking agent is selected to be highly raactive with the nucleic acid 
molecule, to provide rapid, complete, and/or selective reac^^^ in 
15 preferred embodiments, the reaction volume of the reagents {e^ the 
thiol group reagent and the thiol-reactive fuhctiohallty) is small. 

^ofiified nucleic acid promoter-contalnlnji prbbte and linkers 
Preferred nucleic acid promoter-con^iriihg probds for use herein 
are "thiol-modified nucleic acids; ^ l:e.v nucieic%c^^^ 
20 contain at least one reactive thiol moiety. As described in further detail 
In Example 1, below, nucleic iacidS coritairilnrat least one reacti thiol 
are preferably made by treating a nucleic acid coritainmg a 3' or 5' 
disulfide with a reducing agent, whfch preferably will hot Compete in 
subsequent reactions, (i.e. will not react whh: ah 
25 . functiphality). Disulfide-derivatized nucjeic acids can be synthesized 
according to a variety of methods. For example, a hucteic acid can be 
modified at the 3'- or 5'-terminus by reactibh with a disulfide containing 
modifying a reagent. Alternatively, a thiolated primer can by 
enzymatically or non- nzymatically attached to th nucleic acid. A 5'- 
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phosphoramidate functionality can also provide an attachment point for a 
thiol or disuiftde-containing cytosine or deoxycytosine. Examples of 
reducing agents appropriate for reduction of a disulfide-modified nucleic 
acid include: tris-(2-carboxyethyl)phosphihe (TCEP) (preferably a 
5 concentration in the range of 1-100 mM (most preferably about 10 mM)) 
is reacted at a pH In the range of 3-6 (most preferably about 4.5), a 
temperature in the range of 20-45®C (most preferably about 37®G) for a 
time period in the range of about 1 to about 10 hrs (most preferably for 
about 5 hrs); dithiothreitol (preferably a concentration in the range of 25 

10 to 100 mM (depen?ling on whether the reactant is isolated) is- reacted at 
a pH in the range of, 6- 10 (most preferably about 8} and iait a temperature 
in the range of 25-45°C (most preferably about dl'^C)) for a time in the 
range of about 1 to about 10 hrs (most preferably about 5 hrs). TCEP 
provides an advantage in the low pH at which It is reactive. This low pH 

15 affectively protonates thiols/ thus suppressing nuclebphi|ic reactions of 
thiols and resulting in fewer side reactions thaii with other disulfide 
reducing agents which are employed at higher pH; • 

As further described in Example 1 , below, a preferred bifuncttonal 
cross-linking agent is N^succinimidyl(4-iodacetyl) aminobenzoate (SIAB). 

20 Other crosslinking agents Include, but are not limited to, dimaleimide, 
dithio-bis-nitrobenzoic lacid (DTNB), N^succinimidyl-S-ac6tyl-thioacetate 
(SATA), N-succinirnidyK3-(2-pyrldyldithiol propionate (SPDP), 
succlnimidyl 4-(N-maleimiddmethyl)cyclohexane-1 -carboxylate (SMCC) 
and 6-hydrazinonicotlmide (HYNIC) may also be used in the novel 

25 process. For further examples of cross-linking reagents, see, elq. . Wong 
. "Chemistry of Protein C onjugation and Gross-Linlcintii " CRC Press 
11991), and Hermahson, " Bioconiuaate Technioues " Academic Press 
(1995). 
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In other embodiments, the nudelc acid is imrnobilized using a 

photocleavable or photolabile linker mojeties that is cleaved during mass 

> spectrometry. Exemplary photolabile cross-linker Include, but are not 

, ^ limfted to, 3-amino-(2-nitrophenyl)propionic acid (Brbvw ej (1995) 

;S Mp|?ffWl9r PlY?r§ftV> PP.4-1 2 and Rothsbhild it akd 996) Nucleic Acids 

Bsa. 24:361.66) See, alsb Ihtematiohal PGT ai30licatioh No. 
(WO 98/20019). . 

A nucleic acid promoter^ontaining probe can be di^^^^^ 
a solid support via a reversible or irreversible bond between an 
m appropriate functionality (f) on the target nucleic acid molecule (T) and 
an appropriate functionality (L) on the capture rnolecule. The fonnat is 
, ?imilactd that shown in U.S. Patent No. 5,503,980. A reversible linkage 
includes linkages that are cleaved under the conditions of mass 
spectrometry. These include, but are not limited to photocleavable 
15 bonds, such as a charge transfer complex, and labile bonds formed 
between relatively stable organic radicals. 

Furthermore, the linkage can be formed With L' being a quaternary 
ammonium group, in which case, the surfacb of the solid support, 
preferably, carries negative charges to repel the ne^ativaly charged 
20 , nucleic acid backbone, thereby facilitating the deS<irptidn required for 
analysis by a mass spectrometer. Desci-ptidn cah ociiw either by the 
heat created by the laser pulse and/or, depending on L,' by specific 
absorption of laser energy which is in resonance with the L' 
chromophore^ 

25 Thus, the^L-L' chemistry can be of a type of disulfide bond 

(chemically cleavable, g^, by mercaptoethahol or dithioerythrol; a 
biptin/streptevidin system; a heterobifunctlonal derivative of a trityl ether 
group; see, Kdster et aL (1990) "A Versatile Acld^tabile Linker for 
Modification of Synthetic Blomolecules," Tetrahedron Utt>.r. .-^i -vnoR^ 
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that can be cleaved under mildly acidic conditions as well as under 
^ conditions of rnass spectrornetry, a lisvuiinyl group cleavabie under 
, alrnost neutral conditions with a hydrazinium/acetate buffer, an 
arginlne-arginine or lysine-lysine bond xleavable by an endopeptidase 
5 enzyme like trypsin or a pyrophosphate bond cleavabie by a pyrophos- 
phatase, or a ribonucleotide bond in between the oligbdeoxynucleotide 
V sequence, which can be cleaved, for example, by a ribonuclease or alkali. 

The functionalities, L and L,' can also form a charge transfer 
complex and thereby form the temporary L-L' linkage. Since in many 
10 cases the "charge-transfer band" can be determined by UV/vis 
spectrometry (see> e.g., Oroanic Charge Transfer Complexes by R. 
Foster, Academic Press, 1969), the laser energy can be tuned to the 
corresponiding energy of the charge-transfer wavelength and, thus, a 
specific desorption off the solid support can be initiated. Those skilled in 
15 the art will recognize that several conribinations can serve this purpose 
and that the donor functionality can be either on the solid support or 
coupled to the nucleic acid molecule to be detected or vice versa. 

In another approach, a reversible L-L' linkage can be generated by 
hoiTiolytically forming relatively stable radicals. Under the influence of 
ZO the laser pulse, desorption (as discussed above) as well as ionization will 
take place at the radical position. Those skilled in the art will recognize 
that other organic radicals can be selected and that, in relation to the 
dissociation energies needed to homolytically cleave the bond between 
them, a corresponding laser wavelength can be selected (see e.g. , 
25 Reactive Molecules by C. Wentrup, John Wiley & Sons, 1984). 

As noted, several embodiments of the method involving of 
. immobilization are contemplated herein, th se include, but are not 
limited to embodiments in which: 1) the primer nucleic acid is 
immobilized to the solid support and the target nucleic acid is hybridized 
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thefeto to form a promoter sequence; 2) a ddubie-stranded nucleic acid 
encoding a promoter (amplified or isolated) is immobilized through lihicage 
to one predetermined strand, and in yjjEO transcription is initiated In the 
presence of a predetermined deoxy ribonucleotide, and 3) the target 
5 nucleic acid, which cam be amplified where there is availabid sequence 
Information, Is hybridized to a double-stranded nucleic add capture probe 
with 3' flverhang, and the resulting hybrids are immobilized. 

In the embodiments where the primer nucleic acid is Immobil^^^^^ 
on the solid support and the target nucleic acid Is hybridized thereto, the 
Id inclusion of a cleavable linker permits the primer DNA to be Immobilized 
at the 5'-end so that free 3'-0H Is available for "hybridizing" target DNA 
to the fr^e DNA strand and Initiating transcription to 

Any linker known to those of skill In the art for immobilizing 
nucleic acids to solid supports may be used herein to link the nucleic acid 
15 to a solid support. The preferred linkers herein are the selectively 
cleavable linkers, particularly those exemplified herein. Other linkers 
include, acid cleavable linkers, such as bismaleimideothoxy propane and 
acid-labile trityl linkers. Acid cleavable linkers, photbcleavable and heat 
sensitive linkers may also be used, particulariy where it miy be 
20 necessary to cleave the targeted agent to permit it to be more readily 
accessible to reaction. 

Acid cleavable linkers 

Acid cleavable linkers Include, but are not limited to, bismaleimi- 
deothoxy propane; and adipic acld dlhydrazide linkers (see, fi^ Fattom 
25 fiiflL (1 992) Infection &lmfTTiin, fifi:584-589) and acid labile transferrin 
conjugates that contain a sufficient portion of transferrin to permit entry 
into the intracellular transferrin cycling p&thway (see, &a.. WelhSner et 
aL (1991) J. Biol. Chem. 266 :43Q9-AatA) 
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Photocieavable Linkers. 
-Photpcleavable linkers are imkers that are cleaved upon exposure 
to light, (see, Goldmacher et al. (1992) BiocohL Ghem. 3:104-107). 
thereby releasing the targeted agent upon exposure to light. 
5 Photocleavable linkers that are cleaved upon exposure to light are known 
(see, :e^, Hazum et aL (1 98 1 ) in Pent., Proc. Eur. Pebt. Svmp.. 16th. 
Brtinfeldt,: K (Ed), pp. 105-1 10, which describes the use of a nitrobenzyl 
group as a photocleavable protective group for cysteine; Yen et aL 
(1 989) MakromoL Chem 1 90 :69-82, which describes water soluble 
10 photocleavable copolymers, including hydroxypropylmethacrylamide 
, copolymer, glycine copolymer^ fluorescein copolymer and 
methylrhodamine copolymer; Goldmacher et aL (1992) Bibconi. Chem. 
3:104-107, which describes a cross-linker and reagent that undergoes 
photolytic degradation upon exposure to near UV light (350 nm); and 
15 Senter §t aL (1 985) Photochem. Photobioi 42:231 -237, which describes 
nitrobenzylpxycarbpnyl chloride cross linking reagents that produce 
photpcleavable linkages), thereby releasing the targeted agent upon 
exposure to light. In preferred embbdiments; the nucleic acid is 
imrnobilized using the photocleavable linker moiety that is cleaved during 
20 . mass spectrometry.! 

: Chemjpally cleavable linkers 
A variety of chemically cleavable lirikers rnay be used to introduce 
a cleavable bond between the immobilized nucleic acid and the solid 
support. Acid-labile linkers are presently preferred chemically cleavable 
25 linkers for mass spectrometry, especially MALDI-TOF MS, because the 
acid labile bond is cleaved during conditioning of the nucleic acid upon 
addition of the 3-HPA matrix solution. The acid labile bond can be 
introduc d as a separate linker group, e.g.. the acid labile trityl groups or 
may |?e incorporated in a synthetic nucleic acid linker by introducing one 
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or more sllyl internucleoslde bridges using dilsopropylsilyl. thereby 
forming di|sopropylsilyl-linked biigonudebtide analogs. The 

diisbpropylsilyl bndge replaces the phbshodiester bdnd In the DNA 
backbone ^nd undpr mildly addle cb'ndltloiisi; 8u^^ 1:5% 
5 trrfluoroacetio acid (TFA) or 3^HPA/1 % TFA MALDI.TOF matrix solution, 
results in the introduction of one or more intra-strand breliks in the DNA 
molecule, Method^ for the preparation of dllsopropy^^ 
oligonucleotide precursoi's and analogs are knowri to those of skill in the 
art (see &^ Saha gtaL (1993) j. Qrd. CHarrt ^R.7fl97.-yoQif jhese 
10 oligonucleotide analogs may he readily prepared using solid state 
oligonucleotide synthesis methods using diisopropylsilyl derivatized 
deoxyribonucieosides. . 

Modification of nudeic acids . 
A. Mass modification 

15 In certain ernbodiments, nucleic acids modified at positions other 

than the 3'- or 5'- terminus can be used. Modification of the sugar 
moiety of a nudeotide at positions other than the 3' and 5' position is 
possible through conventional methods. Also, nucleic acid bases can be 
modified, s^, as described in F. Eckstein, ed., " OlioonudeotiriAs ;.pH 

2° Analogues: A Practical ApproBoh IRL Press (1991), Such a linker arm 
can be modified to include a thid moiety. Alternatively, backbone- 
modified nucleic acids can be used so that the thiol group can be 
attached to the nitrogen center provided by the mbdifled^ p 
backbone. 

25 'n Preferred embodiments, modification of a nucleic acid; such as 

described abovel does.not substantially impair the ability of the nudeic 
acid molecule to hybridize to its complement. Thus, any; thbdificatiort 
should preferably avoid substantially modifying the functionalities of the 
nucleic acid that are resporisible for Watson-Crick bas^ pairing. The 



SUBSTITUTE SHEET (RULE 26) 



wo 99^31278 



PCT/US98/26718 



•SB- 
nucleic acid can be modified such that a non-terminal thiol group is 
present^ and the nucleic acid, when immobilized to the support, is 
capable of seif-conriplementary base pairing to form a "hairpin" structure 
having a duplex region. , ^ 

5 B. Other modifying RNA analogs r 

In practicing the methods described herein, a set (or sets) of 
nested base-specific chain terniinated RNA transcripts is (are) generated 
during transcription by the incorporation of a modified base-specific chain 
terminating ribonucleotide analog (or specific cleavage by RNases in a 
10 modified Maxam-Gilbert strategy). Any riboriucleoside triphosphate 
analog that results in the sequence-specific arrest of transcription 
elongation upon incorporation into an RNA; molecule by an RNA 
polyrnerase may be used in the methods herein. Presently preferred 
. ribonucieotide analogs are 3'-deoxyribonucteotides, re^iilt in base- 
15 specific termination of transcription (e:a,. sjee A^lrod gt (1985) 
BiochemistrY 24:5716-5723; Tyagarajan m (1585) Biochemistrv 
30:10920-10924). 

In certain embodiments, in addition to a base^specific chain 
terminqting ribonucleoside triphosphate analog, additional ribonucleotide 
20 . analogs can, be added to reduce the secohdai^y structure of the resulting 
RNA transcript and/or to prevent premature trahscriptioh terrhination or 
pausing or RNA cleayage. For example, the incorporation of riboinosine 
using inosine 5'-triphosphate is known to reduce the secbndary structure 
of RNA products. In the presence of a dinucleotide guanine initiator, 
25 , inosine 5 '-triphosphate can effectively substitute for GTP In in vitro 

transcription. redactions (g^, Axelrod ei §L (1 985) Biochemistry 24:571 6- 
5723). 

In addition, modified ribonuci otide analogs may be added to th 
transcription mixture to increase the efficiency of transcriptional 
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V termlnatipn and/or transcript release to promote and fadilitate the rate 
enzyme turnover; For example, the addition of 4'thl6 UTP, 5-bromo UTP, 
5-iodo CTP alter the hydrogen bonding of the nucleic acid facilitating, at 
least with some RNA polymerases, transcriptional termination and 

5 transcript release. 

Solid: supports and substrates 

Examples of Insoluble supports and substrates for use herein 
include; but are not limited to: beads, sOch as silica gel, controlled pore 
glass, magnetic beads, dextran and agaroise (Sephadex and Sepharose) 

ia beads, celiuiose beads and others, capiiiaries, flat supports such as glass 
, fiber filters, glass surfaces, metal surfaces (steel, gold, silver, aluminum, 
sijicqn and copper), plastic materials including muHiwell plates or 
membranes (e^ of polyethylene, polypropylene, polyamide. 
polyvinyidenedifluoride), wafers, combs, pins (e^, arrays of pins 

15 suitable for combiriatorial synthesis or analysis) or beads in pits of flat 

surfaces such as wafers (s^ silicon wafers), with or without filter 
plates. 

jVlass spectrometry 

Once transcription Is complete, the RNA transcript can be 

20 analyzed by any of a variety of means including, for example, 
spectrorpetric techniques such as UV/VIS; IR. flu6res 
chemiluminescence, or NMR spectroscopy, mass sp^^^^^^ gel 
electrophoresis and other methods known in the art, or combinations 
thereof. Mass spectrometry is preferred herein. Mass spectrometer 

25 formats include.the ionization (I) tefehniques, such as niairix assisted 
Jaser desorption (MALDI)> continuous or pulsed, electrbspray (ESI) and 
related methods (g^ lohspray or Thermospray), and niassive cluster 
impact (MCI); these ion sources can be matched with detection formats 
including linear or reflectron time^f-flight (TOF). single oi- multiple 
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quadrupie, single or multiple magnetic sector, Fourier Transform ion 
cyclotron resonance (FTIGR), tbr^ trap, and combinations thereof to yield 
a hybrid detector (£:;g^, ion-trap/time-of -flight). For ionization, numerous 
matrbc/wavelength combinations (MALDI) or solvent combinations (ESI) 
5 can. be employed. 

Preparation of DNA arrays 

in preferred embodiments, nucleic acid promoter-containing 
probes, prior to or subsequent to hybridization to target nucleic acids, are 
immobilized at to the surface of a solid support in an ari-ay format. 
10 Particularly suitable methods for forming thfese DNA arrays are those 
described herein and U;S. application Senal Nos. 08/746^053, 
08/787,639 and 08/786,988. The subject matter of 6atjh pending U.S. 
applications is herein incorporiated by reference in Its eiitirWty and is 
sunrimarized as follows; 
15 Hgure 1 illustrates one systiem for preparing arrays of sa'rtiple 

material for analysis: by a diagnostic tool. Rg. 1 depicts a system 10 
that includes a data processor 12, a motion controller 14, a robotic arm 
assembly 16, a monitor element 18A, a ceritral processing unit 18B, a 
microliter plate of source material 20, a stagfe housing 22, a robotic arm 
20 24, a stage 26,. a piressure cbntrbiler 28; a conduit 30; a mounting 

assembly 32, a pin assembly 38, and substrate elerhents 34. In the view 
shown by Fig. 1, it Is also illustrated that the robotic assembly 16 can 
include a moveable mount element 40 and a horizontal slide groove 42. 
The robotic arm 24 can optionally pivot abbiit a pin 36 to increase the 
25 travel range of the arm 24 so that arrti 24 can disposes the pin assembly 
38 above the source plate 20. 

The data processor 12 depicted in Fig. 1 can be a conventional 
digital data processing system such as an IBM PC compatible computer 
system that is suitable for processing data and for x cutirig program 
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instructions that will provide information for controlling the movement 
and operation of the robotic assembly 1 6. The data processor unit 1 2 
can be any type of system suitable for processing a program of 
instructions signals that will operate the robotic assembly that Is 
5 integrated into the robotic housing 1 6. Optionally the data processor 1 2 
can be a micro-controlled assembly that is integrated Into robotic housing 
1 6. In further alternative embodiments, the systdm 10 need not be 
prpgrarnmabje and can be a single board computer having a firmware 
memory for storing instructions for operating the robotic assembly 1 6. 
^9 V . embodiment depicted in Fig. 1, there is a controller Uthat 

electronically couples between the data processor 1 2 and the robotic 
a?s0mbly16. The depicted controller 14 Is a motion controller that 
drives the motor elements of the robotic assembly 1 6 for positioning the 
robotic arm 24 at a selected location. Additionally, the controller 14 can 
15 provide instructions to the robotic assembly 16 to dii^ect the pressure 
controller 28 to control the volume of fluid ejected from the individual pin 
elements of the depicted pin assembly 38. The design and construction 
Of the depicted motion controller 14 follows from principles well known 
in the art of electrical engineering, and any controller element suitable for 
2Q driving the robotic assembly 1 6 can be practiced without departing from 
the scope thereof . 

The robotic assembly 16 depicted in Fig. 1 electronically couples 
to the controller 14. The depicted robotic assembly 16lsagantry 
system that Includes an XY table for moving the robotic arm about a XY 
25 plane, and further includes a Z axis actuator for moving the robbtii? arm 
orthogonally to that XY plane. The robotic assembly 1 6 depicted in Fig. 
1 Includes an am^ 24 that mounts to the XY stage Which moves the arm 
within a plane defined by the XY access. In the depicted embodiment, 
the XY table is mourned to the Z actuator to move the ntire table along 
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the.Z axis orthogonal to the XY plane. In this way, the. robotic assembly 
provides three degrees of freedom that allows the pin assembly 38 to be 
disposed to any location above the substrates 34 and the source plate 
20 which are shown in Fig. 1 as sitting on the stage 26 mounted to the 
5 robotic assembly 16. 

The depicted robotic assembly 16 follows from principles well 
known in the art of electrical engineering and is just one example of a 
. robotic assembly suitable for moving a pin assembly to locations adjacent 
a substrate and source plate such as the depicted substrate 34. 

10 Accordingly, alternative robotic systems can be practiced following the 
descriptions herein without departing from the scope thereof. 

Fig. 1 depicts an embodiment of a robotic assembly 1 6 that 
includes a pressure controller 28 that connects via a conduit 30 to the 
mount 32 that connects to the pin assembly 38. In this embodiment the 

15 ; mount 32 has an interior channel for fluidic coupling of the conduit 30 to 
the pin assembly 38. Accordingly, the pressure controller 28 is fiuidtcly 
coupled by the conduit 30 and the mount 32 to the pin assembly 38. In 
this way the cpntrolier 14 can send, signals to the pressure controller 28 
to control selectively a fluid pressure delivered to thei pin assembly 38. 

20 Fig. .2 depicts one embodiment of a pin assembly 50 suitable for 

practice with the system depicted In Fig. 1 which includes the pressure 
controller 28. In the depicted embodiment^the pin assembly 50 includes 
a housing formed from an uppeir portion 52 and a lower portion 54 that 
are joined together by the crews 56A and 56B to define ian interior 

25 chamber volume 58. Fig. 2 further depicts that to fluidicly seal the 
interior chamber volume 58 the housing can include a seal element 
depicted in Fig, 2 as an 0-ring gasket 60 that sit s bet^A(een the upper 
block and the lower block 54 and surrounds completely th perimeter of 
the interior chamber volume 58. Fig. 2 further depicts that the pin 
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assembiy 50 includes a pluriality of vesicles 62A-62D/ each of which 
include an axial bore extending theirethrbugh to form the depicted holding 
chambers e4A'-64D. Each of the depicted v^icles extends through a 
■ -r. respective aperture e8A-68D disposed witfiin the lower block 54 of the 
5 housing. 

As further shown in the depicted emhibdiment, each of the vesicles 
62A-62D has an upper flange portion that sits against a seal element 
; : 70A-70D to form a fluid-tight seal between the Veislcle and the lower 
block 54 to prevent fluid from passing through the apertures 68A-68D. 
,10 To keep the seal tight, the depicted pin assembly 50 further includes a 
set of blasing elements 74A-74D depicted in Fig. 2 as springs which, in 
the deleted. embodiments/ are in a compressed state to force the flange 
/ .element of the. vesicles 62A-62D against their respective seal elements 
,i :;;. 70A-70D. As shown in Fig. 2, the biasing elements 74A-74D extend 
15. between the vesicles and the upper block 52. Each of the springs 74A- 
' , 74P can be fixedly mounted to a mounting pad 76A-76D where the 
spring elements can attach to the upper block 52. The upper block 52 
further includes an aperture 78 depicted in Fig. 2 as a centrally disposed 
aperture that includes a threaded bore for receiving a swagelok 80 that 
20 can be rotatably mounted within the aperture 78. 

As further depicted in Fig, 2, the swagelok 80 atteches by a 
conduit to a valve 82 than can connect the swagefok 80 to a conduit 84 
that can be coupled to a pressure source, or alternatively can couple the 
swagelok 80 to a conduit 86 that provides for venting of the interior 
25 chamber 58. A central bora 88 extends through the swegelok 80 and 
couples to the tubing element which further connects to the valve 82 to 
thereby fluidicly and selectively couple the interior chamber volume 58 to 
either a pressure source, or a venting outlet. 
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The pin assembly 50 described above and depicted in Rg. 2 
disposed abpve a substrate element 90 that includes a plurality of wells 
92 that are etched into the upper surface of the substrate 90. As 
illustrated by Fig. 2, the pitch of the vesicles 62A-62D is such that each 
5 vesicle is spaced from the adjacent vesicles by a distance that is an 
Integral multiple of the pitch distance between we|ls 92 etched into the 
upper surface of the substrate 90. As will be seen from the following 
clescription, this spacing facilitates the parallel dispensing of fluid, such 
that fluid can be dispensed into a plurality of wells in a single operation. 
1 Q Each of the vesicles can be made from stainless steel, silica, polymeric 
material or any other material Suitable for holding fluid sample. In one 
example, 16 vesicles are employed in the assembly, which are made of 
V hardened beryllium copper, gold plated over nickel plate. They are 43.2 

mm long and the shaft of the vesicle is graduated to 0;46 mm outer 
15 diameter with a concave tip. Such a pin was chosen since the pointing 
. accuracy can be approximately 501 micrometers. Any suitable pin style 

can be employed for the deylce, including but not limited to flat, star- 
^ shaped, concave, pointed, solid, pointed semi-hollow; angled on one or 
both sides, or otheir such, geometries. ' 
SO Rg. 3 shows from a side perspective the lower block 54 of the pin 

assembly 50 depicted in Rg. 2. Fig. 3 shows approximate dimensions 
for one pin assembly. As shown, the lower block 54 has a bottom plate 
98 and a surrpunding shpulder 100. The bottom plate 98 la 
approximately 3 mm In thickness and the shoulder 100 is approximately 
25 5. mm in thickness. 

Fig, 4 shows from an oyerhaad perspective the general structure 
arid dimensions for one lower bibok 54 suitabi for us with the pin 
assembly for use with the pin assembly 50 shown in Fig. 2. As shown 
in Fig. 4, the lovyer block 54 includes a four-by-four matrix of apertures 
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68 to provide 1 6 apertures each suitable for receiving a vesicle. As 
described above with reference to Fig; 2, the spacing between the 
aperture 68 is typically an integral multiple of the distance between wells 
on a substrate surface as well as the wells of a source plate. 
5 Accordingly, a pin assembly, having the lower bl^^ depicted In 

Fig. 4 can dispense fluid in up tb 1 6 wells simliitaneously. Fig. 4 also 
shows general dimensions of one lower block 54 such that each side of 
block 54 is generally 22 mm in length arid the pitch between aperture 68 
is approximately 4. 5 mm; Such a pitch is suitable for use with a 
IP, substrate where fluid is to be dispensed at locations approximately 500 
/flti apart^ as exemplified by the substrate 90 of Fig. 2. Fig. 4 also 
shows that the lower block 54 can include an optional 0-ring groove 94 
adapted for.Teceiving an 0-ring seal element, such as the seal element 60 
depicted In Fig, 2. It is understood that such a groove element 94 can 
;15 enhance and improve the fluid seal formed by the seal element 60. 

The pinblock can be manufactured of stainless steel as this 
material can be drilled accurately to about +25 /mi, but a variety of 
probe materials can also be used, such as G10 laminat^, PMMA or other 
suitable material. The pin block can contain any number of apertures and 
20 is shown with 16 receptacles which hold the 16 pjns in place. To 

increase the pointing accuracy of ^^ch ^liri, an optional Alignment place 
;ban be placed below the block so that abbut 6 mm of the pin tip is left 
exposed to enable dipping into the wells of a microtit^r pl^e. The layout 
of the probes in the depicted tool Is designed to Cobrdinafe with a 384- 
25 well microtiter plate, thus the center-to-center spacing of the p^bbes in 
4.5 mm. An array . of 4 x 4 probes was chOs^ since it would produce an 
array that would fit in less than one square inch. Which is the travel 
range of an xy stage of a MALDI TOF MS mployed by the assignee. 



SUlBSTITUTE SHEET (RULE 26) 



wo 99/31278 



.PCT/US98/267t8 



-43- 

The pintool assembly is completed with a stainless steel cover on the top 
side of the device which is then attached onto the Z-arm of the robot. 

With reference to Figs 1 and 2, the robotic assembly 1 6 employs a 
-v. pin tool assembly 38 that is configured similaHy as the pin tool asserhbly 
5 $0 depicted in Rgure 2; ;The pressure cohtfdiler 28 selectively controls 
the pressure within chamber 58. With this embodiment, a control 
program operates on the data processor 12 to control the robotic 
assembly 1 6 in a way that the assembly i 6 prints an array of elements 
on the substrates 34. In a first step, (see, also Rgures 2 and 5), the 
10 program directs the robotic assembly 16 to move the pin assembly 38 to 
be disposed, above the source plate 20. The robotic assembly 16 will 
then dip this pin assembly into the source plates 20 Which can be a 384 
well DMA source plate. As shown in Fig. 4 the pin assernbly can include 
1 6 different pins such that the pin assenibly 50 will dip 1 6 pins into 
15 different 16 wells of the 384 well DNA source plate 2(5: Next the data 
prpcesspr 12 will direct the motion controller 14 to operate the robotic 
assembly.! 6 to move the pin assembly to a position abbve the surface of 
the substrate 34.. The substrate 34 can be any substrate suitable for 
l^ceiving a sample of material arid can be formed of silicon, plastic, 
20 metal, or any. other such, suitable material. Qptibhally the substrate will 
have a flat surface, but can alternatively include a pitted surface, a 
surface etched with wells or any other suitable surface typography and 
further containing beads pn the surface of each well. The program 
operating on dal^ processor 12 can then direct the robotic assembly, 
25 through the motion controller 14, tp direct the pressure controller 28 to 
generate a positive pressure within the interior chamber volume 58. In 
this practice, the positive interior pressur will force fluid from the 
holding chambers of vesicles 62 to eject fluid from the vesicles and into 
a respective well 92 of th substrat 90, 
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The program operating on data processor 12 cah 
controller 14 to control the pressure controller 28 to cbntrol filling the 
holding phanribers with source! materiat from the source plate 2 The 
prMsure controller 28 can gehorate a negative pr^ss^^^^ 

"5 interior chamber volume 58 of the pin assembly. This will cause fluid to 
be dra wn up into the holding chambers of the vesic 
pressure controllef 28 can regulate the pressure either by open-loop or 
closed-loop control to avoid having fluid overdrawn thr&ugh the holding 
charnbers and spilled imo the interior chamber volume 58. Loop control 

10 sy«ems for controlling pressure are well khowh in the art and aiiy 
suitable controller can be employed. Such spillage could cause cross- 
cprrt^mlnation, particularly if the source material drawn from the source 
. plate 2Q yan>s fr^ ; , 

In an alternative embodiment, each of the how 
15 64D is sufficiently small to allow the chambers to be filled by capillary 
action, 'n such a practice, the pin assembly can incliide an array of 
narrow bore needles, such as stainless steel needles, that extend through 
the apertures of the lower block 54. The needles that ar^ dipped into 
source solutions will be filled by capiliary action; In one practice, the 
20 length of capillary which is to be filled at atmolpheric pressure is 
detemined approximately by: 

....... ,■ PGR 

where H equals Height; gaitima equals surface tension, P equals solution 

25 dehslty, G eqiialS gfavitat^^^^ forbe and R equals needle radius. Thus 

the volume of fluid heidlby eacK vesible can bo controlled by selecting 

the dimensions of the interior bore, rt 

tehiperature water^ill fiMl& cm length of l60^^>^ capillary. 
Thus, a short bore nahb|it r volume n edie will fill to full capacity, but 
30 should not overflow because the capillary force is understood to be too 
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smalLto,form a meniscus at the top of the needle orifice. This prevents 
cross-contamination due to spillage. In one enibodimeht^ the vesicles of 
the pin assembly can be provided with different sized interior chambers 
for holding and dispensing drfferent volunntes of fluid. 
5 Jn an alternative practice, to decrease the volume' of liquid that is 

: i drawn Into the holding chambers of the vesicles^ a snrtall positive 
pressure can be provided within the interior charhber volume 58 by the 
pressure controller 28. The downward force created by the positive 
pressure can be used to counter the upward capillary force. In this way, 
10 the volume of fluid that Is drawn by capillary force into the holding 
chambers, of the vesicles can be controlled. 

Fig. 5B shows that fluid within the holding chambers of the needle 
can be dispensed by a small positive pressure introduced through the 
central bore 88 extending through a swagelok 80. By regulating the 
.15 pressure pulse that is introduced into the iriteribr chamber volume 58, 
fluid can be ejected either as. a spray or by droplet formation at the 
needle tip. It is understood that the rate of dispensing, droplet versus 
spray, depends in part upon the pressure appliisd by the pressure 
controller 28. In one practice, pressure is 
20 : between- 10 and 1 ,000 Torr of atmospheric pressiire, 

To this end the data processor 12 cart run a comjputer program 
that cpntrols and regulates the volume of fluid disperised. The program 
can direct the controller 28 to eject a defined Volume of fluid, either by 
generating a spray pr by forming a drpp that sits at the end of the 
25 vesicle, and can be contacted with the substrate surface for dispensing 
the fluid, thereto^ 

Figures 5C and 50 show the earlier steps shown in Figs. 5A-5B 
can again be performed, this time at a position on the substrate surface 
that is offset from the earlier position. In the d picted process, the pin 
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15 



20 



25 



tool is offset by a distance equal to the distance between two welis 92. 
Other offset printing techniques can be emplbyed. 

Seyerai advantages of the pin assembly depicted in Fig. 2 are 
achieved. For example, rinsing between dispensing events is straight- 
i forward, requiring only single or multiple pin fillings and emptying events 
With a rinse solution. Moreover, since all holding chambeH; fill to full 
capacity, the accuracy of thervolumes dispensed varies only according to 
.needle inner dimensions which can be carefully corttrolled during pin 
production. Further the device is cost effective; with the greatest 
ejcpense attributed to the needles, however because no contact with a 
surface is required, the needles are exposed to little physical strain or 
stress, making replacement rare and providing long life. 

Alternatively, deposition of sample material onto solid support 
.surface can include techniques that employ pin tool assemblies that have 
solid pin elements extending from a block wherein a robotic assembly 
dip^ the sojid pin elements of the pin assembly into a source of sample 
material to wet the distal ends of the pins with the sample materials. 
Subsequently the robotic assembly can move the pin assembly to a 
. location above the substrate and then lower the pin assembly against the 
surface of the substrate to contact the individual wetted pins against the 
surface for spotting material of the substrate surfece. 

Pin tool assemblies that have solid pins with a promoter-containing 
nucleic acid immobilized thereon Can be dipped into wells containing a 
reaction mixture for hybridization and/or generation of RNA transcript. 
The liquid with the transcripts can then be deposited in an array as 
discussed above, for example by using solid pins of a pin tool assembly 
to spot sample on the substrate surface. 

Figures 6A and 6B depict another alternative system for dispensing 
material on or to the surface of the substrate. In particular. Figure 6A 
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depicts.a jet printing device .1 10 which includes a capillary element 112. 
a transducer elerhent 1 14 and orifice (not shoWn) 1 1 8; a fluid conduit 
122, and a mount 124 connecting to a robotic arm assembly, such as 
the robotic arm 24 depicted in Figure 1 . As further shown in Figure 6A 
5 the jet assembly 1 10 is suitable for ejecting from the orifice 118 a series 
, of drops 120 of a sample material for dispensing sample material onto 
the surface 1 28. 

The capillary 1 12 of the jet assembly 1 10 can be a glass capillary, 
a plastic capillary; or any other suitable housing that can carry a fluid 
1 0 sample and that will aljow the fluid sample to be ejected by th^ action of 
a transducer element/ such as the transducer elerhent 1 14- The 
transducer elehierit 11 4 depicted in Figure CA is ai 'piezd ele^ 
transducer element which forms around the paraihieter of the capillary 
1 12 and can transform an. electrical pulse Vdcieived from the pulse 
1 5 generator within a robotic assembly 1 6 to causie fluid to ^|ect from the 
orifice 11 8 of the capillary 112. One such jet assembly having a 
Piezoelectric transducer element is manufactured by MicroFab 
Technology, Inc., of Germany. Any jet assembly, however, that is 
suitable for dispensing defined and controlled the volumes of fluid can be 
2Q . used herein including those that use piezdelectrib transducers, electric 
transducers, electrorestrictive transducers, maghetorestricJive 
transducers, electrornechariical transducers, or any other suitable 
transducer .element. In )the depicted embodiment, the capillary 112 has a 
fluid conduit 1 22:for receiving fluid material. In an optional embodiment, 
25 fluid can be drawn into the capillary by action of a vacuum pressure that 
will draw fluid through the orifice 1 1 8 when the orifice 11 8 is submerged 
in a source of fluid material; Other embodiments of the jet assembly 110 
can, be practiced with the invention without departing from the scope 
thereof. 
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Figure 6B illustrates a further alternative assembly suitable for p 
being carried on the robotic arm of a robotic assembly, such as the 
assembly 1 6 depicted In Figure 1 . Figure 6B illustrates four jet 
: 1 ^ ^?«emblies connected together, 1 30A-1 30D. Similar to the pin assembly 
5 ,in Figure 2, the jet assembly depicted in Rgure 66 can be employed for 
the parallel dispensing of fluid material. Each of the jet assemblies 1 30A- 
130D can be operated independently of the others, allowing the selective 
dispensing of fluid from select ones of the jet assemblies. Moreover, 
each of the jet assemblies 130A-130D can be independently controlled to 
1P , select the yolume of fluid that is dispensed from each respected one of 
^ J theassernbly 130A-130D. Other modifications and alterations can be 
made to the assembly depicted in Figure 6B without departing from the 
scope pf the irivention. 

Arrays can be formed on a substrate surface according to any of 
15 the techniques discussed above. The sample arrays are then analyzed by 
mass spectrometry to collect spectra data that is representative of the 
composition of the samples in the array. It is understood that the above 
methods provide processes that allow for rapidly dispensirig definite and 
controlled volumes of analyte material. In particular these processes 

20 aHow for dispensing sub to low nanoliter volumes of fluid, these low 
volume deposition techniques, generate; sample array^ well suited for 
analysis by mass spectrometry. For example, the low volumes yield 
reproducibility of spot characteristics, such ai evaporation rates and 
reduced dependence on atmospheric conditions such as ambient 

25 temperature and light. 

Continuing with the example shown in Fig. 5, the arrays can be 
prepared by loading oligonucleotides (0.1-50 ng///lj of diff^ent 
sequenc s or concentrations into the wells of a 96 well microtiter source 
plate 20; the first well can be r served for holding a matrix solution. A 
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substrate 34, such as a pitted silicon chip substrate, can be placed on 
the stage 26 of the robotics assembly 16 and can be aligned manually to 
orient the matrix of wells about a set of reference axes. The control 
program executing ori the data processor 12 can receive the coordinates 
" 0 of the first well of the source plate 20. The robotic arm 24 can dip the 
pin assembly 38 into source plate 20 subh that e^ch of the 16 pins is 
dipped into one of the wells. Each vesicle can fill by capillary action so 
that the full volume of the holding chamber contains fluid. Optionally, 
the program executing on the data processor 12 can dii-ect the pressure 
10 controller to fill the interior chamber 58 of the pin assembly 38 with a 
positive bias pressure that will eouhteract; in part, the force of the 
, capillary action to limit or reduce the volume of fluid that is drawn into 
the holding chamber. 

Optionally* the pin assembly 38 can be difiped into the same 1 6 
15 wells of the source plate 20 and spotted on a second tirget substrate. 
This cycle can be: repeated on as many target substrates as desired. 
Next the robotic arm 24 can dip the pin assembly 38 in a washing 
solution, and;then dip the pin assembly imo 16 different wells of the 
; source plate 20, and spot onto the substra^^^^^^^^ distance 
20 from the initial set of 1 6 spots . Again this cah be repeated for as many 
target substrates as desired. The entire cycle can be repeated to make a 
2x2 array from each vesicle to produce an 8x8 array of spots (2x2 
. elemehts/yesicle x 1 6 vesicles = 64 total elements spotted). Any 
process suitable for forming arrays c^n be employed. 
25 Oligonucleotides of different sequences or concentrations can be 

. loaded into the wells of up to three different 3B4-well microtiter source 
plates; one set of 16 wells can be reserv d for matrix solution. The 
. wells of two plates are filled with washing solution. Fiv microtiter 
plates can be loaded onto the stage of the robotic assembly 1 6. A 
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plurallty of target substrates can be placed abutting an optional set of 
banking or registration pins disposed on the stage 26 and provided for 
aligning the target substrates plong a set of reference axes. If the matrix 
and o|igpnucieotide are not pre-mixed, the pin- assembly can be employed 
' "5 to first spot rnatrix solution on alt desired target subsfrates. in a 

subsequent step the oligonucleotide solution can b« spotted in the same 
pattern as the rnatrix material to re^issolve the rnatrix. Alternatively, a 
sample array can be rnade by placing the oligonucleotide solution on the 
wafer first, followed by the matrix solutibni or by p re-mixing the matrix 
10 . and oligonucleotide: solutlcns, 

After depositing the sample arrays onto^t^^^ 
substrate, the arrays can be analyzed using any of a^^^^^^^^ • 
(&a., spectrometric techniques, such as UVA/IS, IR, fluorescence. 
chemiluminescenc^.NiyiR spectromet^ mass spectrometry. For 
15 exampi^, subsequent to either dispensing process, sample loaded 

substrates .can be placed onto a MALDI-TOF source plate and held there 
with a set of beyeled screw mounted polycarbonate supports. In one 
practice, the plate can be transferred on the end of a probe to be held 
onto a 1/mi resolution, 1- travel xy stage 

20 of a time-of-fllght mass spectrometer. Any suitable mass spectrometry 
tool can be employed. 

Preferred mass spectrometer formats for use with ^^^^^ 
described herein include ionization <l) techniques inbluding but not limited 
to matrix assisted laser desorption (MALDI), contlnudus or pulsed 
25 electrospray (ESI) and related rhethods (e^ lonspray or thermospray), or 
massive cluster Impact (MCI).- those ion sources can be rriatched wrth 
detection formats including, linear or non^lihear reflectron time-of-flight 
(TOP), single or multiple quadruple, single or multiple magn tic sector, 
Fourier Transform ion cyclotron resonanc (FTICR), ion b^ap, and 
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concibinations thereof ( e.g.. ipn-trap/time-of-fiight). For ionization, 
. numerous matri}^wavelengtli combinations (MALDI) or solvent 

combinations (ESI) can be. employed. Subattomole levels of protein have 
been detected for example, using ESI (Valaskovic, G. A. et al., (1996) 
5 Science 273: 11 99-1 202) or MALPI (Li, L. et al . (1996) J. Am. Chem. 
Sfic J^S: 1.662-1663) mass; spectrometry. 

Thus, it will be understood that in processes described herein a 
completely non-contact, high-pressure spray or partial-contact, low 
pressure droplet formation mode can be employed. In the latter, the only 
contact that will occur iS; between the droplet and the walls of the well or 
a hydrpphilic flat surface of the substrate 34. In neither practice need 
.there be any coritact between Jhe needle tip and the surface. 
Exemplary embodiment i 

. In, one preferred embpdiirient, a dpublerstrah^^ 
sequence encoding, a pronjpter sequence is isolated from a natural source 
(SiEL, bacteria, viruses; bacteriophages, plants or eukaryotic organisms) 
or assembled from synthetic sequences^ A sing|e-stranded region of at 
least a 5 nucleotides at the 3'-end of the coding strand using standard 
methods known to those pf skill in the art ie.a.. see Sambrbok et aL, 
(1989) Molecular Cloning,.2nd ed.. Cold: Spring Harbor Labbratory Press, 
New York), this single-^stranded reQ'on is 

complenjeritary to a region of the nucleic acid to be sequenced or to an 
sequence shared between the two nucleic acid molecule^ ( e.g.. a 
restriction ehdonuclease site). 

The nucfejc acid to be sequenced containing at least a partially 
single-stranded 3'-end is hybridized according to the conditions described 
herein, preferably high stringency such that 5 contiguous matching base 
pairs form, and known to those of skill in the art to the complementary 
sequences of the promoter-containing DNA. The nucleic acid to be 
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sequenced may be double-stranded or preferably Sinfgle-stranded. The 
hybridization of the two nucleic acid moiecuies introduces one or more 
"nicks" in the hybrid at the junction(s} of the adjacent nucleic acid 
molecules. Nicks in the coding or nbii-coding strarid, preferably the 
5 coding strand, can be ligated by the addition of an appropriate nucleic 
acid ligase prior to initiating transcription. Methods for ji^ating nucleic 
racids are well known to tiiose of skill in the art {e^, see Sambrook et 
IL. (1989) Molecular Cloning; 2nd ed.. Cold Spring Harbor Laboratory 
, ; Press, New York) and DNA and RNA iigases ere commercially available 
10 (Saai/ Boehringer Mannheim, Indlariapolis, IN). ' 

transcription Is initiated from the promoter by the addition of the 
appropriate RNA polymerase in the presence of ribonucleoside 
triphosphates under conditions described herein and known elsewhere 
, (e^. In RNA Polymerase and tha p^g uiatidh of tran«nr.p t.»n r^,.oI,«<« 
15 et aL, eds, Elsevier, NY). In preferred embodiments, a selected base- 
specific chain terminating 3'-deoxyrib6nucleoside triphosphate and the 
transcription mixture also contains inosine S'-triphosphate to reduce tiie 
secondary structure of the RNA product or modified ribonucleoside 
triphosphates, such as 4-thio UTP, 5-bromo UTP or 5'-iodo OTP to 
20 facilitate turnover of the RNA polymerase enzyme and thereby Increase 
the amount of RNA transcript available for analysis. 

in preferred embodiments, the nucleic acid prdnioter-containing 
probe is oovalently immobilized on a 'silica suppbrt by functionalization of 
the support with an amino functioriallty (e^; by derivatizatipn of the 
25 support with a reagent such as 3-aminopropyi-triethoxysilane (Aldrich 
Chemical Co., Milwaukee; Wl); ^ee Figure 7). Other fuhbtlonalized 
0xy$ilanes or orthosilicates c&h be used, ^nd bre commerelally available 
(S^ from Qelest, Inc., Tullytowri, PA). For example, 3-mercaptopropyl- 
triethoxysilane can be used to functionalize a silicon surface with thiol 
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groups. The amino-functibnalized sifica can then be reacted with a 
heterobtfunctional reagent such as N-succinimidyl (4-iddacetyl) 
aminobenzoate (SIAB) (Pierce, Rockford, IL). Other homo- and hetero- 
bifunctional reagents which can be employed are available commercially, 
5 e.g. . from Pierce. Finally; d nucleic acid functibhalized with 'a thiol group 
( e.Q. , at the 5'-terminus) is covaiently bound to the derivatized silica 
support by reaction of the thiol functionality of the nucleic acid molecule 
with the iodoacetyl functionality of the support. 

In certain embodiments, the nucleic acid can be reacted with the 
10 cross-linking reagent to form a crbss-linker/nucleic acid conjugate, which 
is then reacted with a functionalized support to provide an immobilized 
nucleic acid. . Alternatively, the cross-linker can be combined with the 
nucleic acid and a fur^ctionalized solid support in one pot to provide 
substantially simultaneous reaction of the crbss-Iihkihg reagent with the 
15 nucleic acid and the solid support. In this embodiment, it will generally 
be necessary to use a heterobifunctional cross-llriker; i^e., a cross-linker 
with two different reactive functionalities capable bf selective reaction 
with each of the nuqieic acid and the functionalized solid siupport. 
Following the methods described herein, spaitially-addressable 
Zp. arrays of nucleic acids immobilized on insoluble supports that are suitable 
fpr seflU0n(Ding of nucleic acids using RNA polymerase can be prepared. 
, For example, the.rnethods..can be used to provide arrays of different 
nucleic acids immobilized on pins arranged in an ariray. In another 
embodiment, a phbto-cleavable protecting group on the Insoluble support 
25 can be selectively cleaved (e^, by photoHthbgraphy) to provide portions 
of a surface activated for immobilization of a nucleic acid. For example, 
a silicon surface, modified by treatment with 3-mercaptopropyl- 
triethoxysilane to provide thiol groups, can be blocked with a 
photocleavable protecting group (for examples of phbtocleavable 
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protecting groups, see, e^, PCT Publication WO 92/10092, or McCray 
et aL.(1989) Ann. Rev. BiophvB. Bion^^ys^ ci^^jr. i a.'i^Qimny gnd be 
selectively deblocked by irradiation of selected ardas of tlie surface, §^ 
by use of a photolithography mask. A nucleic acid promoter-containing 
: 5 probe modified to contain a tiiiol^reactive group can then be attached 
> directly to the support, or, altematively. a tiiioNreactive cross-linking 
reagent can be reacted with the thiolHrhodified support, followed by <or 
substantially simultaneously with) reaction with a nucleic acid to provide 
immpbilized nucleic acids. A nucleic acid base or sequence, once 
10 immobilized on a support according to the methods described herein, can 
be furtiier modified, according to known methods. For example, the 
: , .nucleic acid molecule can be lengthened by performing solid-phase 
nucleic acid synthesis according to conventibnal techniqaes, including 
: . pqmb^^ 

15 Preferably tiie nucleic acids are covalently bound to a surface of 

the insoluble support through at least one sulfur atom, he,, the nucleic 
acids are coyalentiy bound to the surface through a linker moiety which 
includes at least one sulfur atom. Such covalentiy bound nucleic acids 
are readily produced by the methods described herein. In preferred 

20 embodiments, the covalently bound nucleic acids are a present on the 
surface qf the. insoluble support at a density of at least about 20 
fmql/mm^, more preferably at least about 75 fmol/itim^, still more 
preferably at least about fmol/mm^ yet more preferably at least about 
1 00 f mol/mm^ and most preferably at least about 1 50 fmol/mm^. 

25 ; Applications. 

The metliibds cf sequencing nucleic acids described herein may be 
used for a variety qf end use applications. For example," the methods 
may be used for diagnostic applications for the identification of 
mutations, such as transitions, transversions, d I tions, insertions and 
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the likis. The rnethpds may also be used to assist in the diagnosis of a 
number of genetic disorders, used for genetic screening or the 
deterrnlnation of heredity or tissue or organ compatibility. 

The method of sequencing may be used for diagnostic applications 
5, to deteriTiined the presence of. genetic alterations in a known target 
. nucleic acid. For example, a region of the target nucleic acid can be 
amplified using standard methods, such as PCR or other amplification 
. rriethoc^s known to those of skill in the art ( e.Q.. see Sambrook et aL, 
(1989) Molecular Cloning, 2nd ed.. Cold Spring Harbor Laboratory Press, 
10 New York), The amplified nucleic;acid can be denatiirated and the strand 
to be sequenced (the noncoding strand, is isolated or may be used as a 
double-stranded molecule. 

The presence of genetic alterations In a known target nucleic is 
determined in a diagnostic application by isolating (or amplifying) a target 
15 nucleic acid from a biologipal sarhple and hybridizing the target to a 
promoter containing capture probe to generate a promoter containing 
template. A nested set of RNA transcripts is generated and the 
. sequence of the target nucleic acid determined and compared with a 
. wildtyp? sequence to determine if a genetic alteration has occurred In the 
?0 target nucleic acid. The presence of asgenetic alteration can be 

determined from the, RNA transcripts by other means known in the art 
(see, e^, U.S. Patent No. 5,605,798 and International POT application 
No. WO 98/20019). 

Methods off Identifying Transcriptional Terminator Sequences 

Methods of identifying heretofore unknown transcriptional 
terminator and attenuator sequences are also provided. Transcriptional 
terminator sequences are responsible for the sequence specific cessation 
pf RNA elongation and such terminator sequences from a variety of 
organisms have been reported. For example, bacterial rho-independent 
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terminator sequences have two inverted repeats separate by several base 
pairs followed by a 5- lO nt polyT stretch. Upon ti^ahscription through 
these sequences, the resulting mRNA transcript fpirms a RNA hairpin 
stem-loop secondary sti-ucture behind the RNA polymerase molecule that 
5 increases pausing of RNA polymerase and/of destabilizes the RNA 
polymerase^DNA template interaction resulting In termination of the 
transcript within the DNA pdlyT stretch; (e^, see Wilson and von Hippel 
(19; Reynolds et al. (1992a) j. Mbl. Bioil. ?yArR^fi?>. Reynolds et aL 
(1992b) J. Mot. Biol. 224:31-51 ; Teleshitsky et aL (1989) Biochemistry 
10 28:5210-5218; d'Aubentbn Carafe et aL (1990) J. Mol. Biol, ^ifirftas- 
858). 

Bacterial rho-dependent terminators lack the traditional inverted 
repeat stem-loop structure and further require additional factors, such as 
rho protein, to halt transcription (e^, see Schmidt M al^ (1984) J. Mol. 
15 BioL 252:15000-15002) . Rho-dependent termination is believed to result 

in the premature termination in bacterial species upon uncoupling of 
transcription and translation. 

By modifying the standard transcription conditions described 
herein, transcriptional terminator sequences, rho-dependent and rho- 

20 independent terminators, may be identified Using mass spectrometric 
methods. In practicing the methods, a sihgle-striirided region of the 3'- 
end of the nucleic acid to be sequenced is hybrldiiied to a'complementary 
sequence at the 3'-end of the coding str&nd a promoter-containing 
nucleic acid probe. In preferred embodiments, the promoter-containing 

25 nucleic acid is covalently coupled via the 5'-6hd of the noncoding strand 
or 3'-end of the coding strand to a solid support and, rrioKe preferably. Is 
a;5'- or 3%thiolated DNA linked at high densities to a aminbsilane-treated 
solid support. The linkage may be in the absence or presence of a linker 
group and is preferably arranged in an array format. 
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Tiranscription is initiated in the absence or presence of modified 
RNA,,triphQsphate analogs that iricrease the efficiency of RNA polymerase 
termination at such terminator sequences/ such as 4-thid UTP, 5-bromo 
UTP or 5 -Tiodo CTP; The mass of the specifically terminated RNA 
.5 transcripts can, be detected by mass spectrometry where the observed 

mass of the RNA is indicative of the location of the terminator-dependent 
; . , artesXc By comparing of the alignment of the sequence 

immediately preceding the site of transcriptional termination from several 
distinct genomic locations, heretofore unknown terminator sequences 
10 may be identified for different RNA polymerases. 

In certain embodiments, nicies in one or more strand resulting from 
the hybridization pf t^ to be sequenced may be ligated by 

the addition of an appropriate nucleic acid ligase prior to initiating 
tra^nscriptiori (Lg^ addlng a DNA or RNA ligdse). 

4t> : The present invention is further iHustra^^ 
Examples, which area intended merely to further illustrate and should not 
be construed as limiting. The entire contents of all the of the references 
(including literatgre references, issued patents, published patent 
r .applications, and cocpending patent applications) cited throughout this 
2p application are hereby expressly ^i^ by reference. 

- . EXAMPLE 1 
ifigh Density AtlBchment of nucleic Acids to Silicon Wafers 
, Materials and Methods 

All reagehtis, unless otherwise noted, were obtained from Aldrich 
25 Chemical, Milwaukee, Wl;, 
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Silicon Surface Pretparation 

Silicon wafers were washed wfth ethandl/ flamed ov^^ 
burnei;, and immersed in an anhydrous solution of 25% (by volume) 3- 
aminopropyltriethoxysilane in toluene for 3 hours. The sllahe solution 
5 was then removed; and the wafers weire washed thrive times with 
toluene and three times with dimethyl sulfoxide (DIVISO). The wafers 
> were then Jncubated ir» a TO rhM anhydrous solution of N-succinimidyl (4- 
ipdoacetyi) anninobenzoate (SIAB) (Pierce ehemieali Roekfbrd, IL) in 
anhydrous DIVtSO. Following the reaistion, the SIAB solution was 
10 removed> and the wafers were washed three tini^ with DMSO. 

, Since it vyras impossible to monitor the condensation of SIAB and 
the .^rnino group while on the solid sufspbrt of the waf^r, the reaction 
was performed in Solution to determine the optimal reaction time. Thin 
layer chromatography (TLC) (glass baclced silica plates with a 254 nm 
1 5 fluorescent indicator) (Baker, Phillipsburg. NF) was employed using 95:5 
chloroform:methanol (Baker, Phillipsburg, NJ) which enabled separation 
of the two starting materials. It was possible to visualize the SIAB 
starting material under long wave ultraviolet light (302 nm); 3- 
aminopropyltriethoxysilane was not active under ultraviolet light, 
20 therefore, the plate was sprayed with a solution of hinhydrin which 
reacts with primary amines to reveal a purple spot Upon heating. A 
microscale reaction was run in chloroforrii/DMSb uSing a lllght^^ 
excess of SIAB in comparison to 3-amin6prbpyltiiethbxysllane and 
monitored with the above mentioned Tic conditions. 
25 Oiigonudeotide Modifications . 

Reduction of the disulfide from 3'- or 5'-disulfide-containihg 
oligodeoxynucleotWes (Operon Technologies, Alameda, CA or Oligo Etc., 
Wilsonville, OR) was monitored using reverse-phase FPLC (Pharmacia. 
Piscataway, NJ); a shift can be seen in the retention time of the 
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oligodeoxynuclebtide upon cleavage of the disulfide. Various reduction 
/ r niethbds were investigated to determine the optimal conditions. In one 
. case; the disulfide-fcontaining oligodebxyntibleotide (31.5 nmol, 0.5 mM) 
) : was^incubated with dithiothreitol fDTT) (Pierce Cherriical; Rockford, IL) 
5. ' (6.2!mmGl; 100 rhM) as pH 8.0 and 37°C. With the cleavage reaction 
■ essentially completerthe free thibl-containihg biigodeoxynucleotide was 
> isolated using a Ghromaspin-10 column (Glontech, Palo Alto, CA) since 
( DTT may compete in the sub3equent reaction. Alternatively, tris-(2- 
. carbdxyethyl) phosphihe (TCEP) (Pierce Ghemical, Rockford, IL) has been 
10 used to cleave the disulfide. The disulfide-cbntaining 

>; . oligodeoxynucleotide (7.2 nmol, 0.36 mM) Was incubated with TCEP in 
pH 4,5 buffer at 37PC. It is not necessWy to isolate^ 
following the reaction since TCEP does not competitively react with the 
todoacetamido functionality. Varying conbehtraiionsi of -TCEP were used 
15 for the cleay^^ determine the optimal conditions for the 

jCpnjugatioh reaction. 
Probe Coupling 

To each wafer which had been derivatized to contain the 
io^pacetarnidp functionality as described abbve was added a 10 mM 

20 r ; aqueous .solytion of the free^thiol containing olijgodeokyriuclebf ide in 100 
mM phosphate buffer/ pH 8; the reaction was allowed to piroceed for a 
minimum of five hours at room teinperatiiref in 1 6o% relative humidity. 
Fplipwing the reaction, the oiigodeoxynucl^btide sojiitibh was removed, 
and the wafers were washed two times in 5 X SSC buffer (75 mM 

25 ; sodium citrate; 750 mM sodium chloride, pH 7) with 50% forniamide 
(USB, Cleveland; OH) at 65®C for 1 hour each. 
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Radiochemicar Determination of Probe Oen?^ 
In order to determine the amount of ONA covalently attached to a 
surface or the amount of a complementary sequence hybridized, 
radiplabeled probes were employed. In cases wheVe a 5'-disulfide- 
5 containing oligodeoxynucleotide was; to be iriimobilized* the 3r-terminus 
waf radiolabeled using ternriinai transferase ehzynie and a radlol^^^ 
dideojcynucleoside triphospliate; in a standard reaction; 15 pmol {0.6;/M) 

of the. S'rdisulfideHjontaining oligodeoxynucleotide was iricubated wi* 
50//Ci (16.5 pnrK)!; .0.667iM) of |ff-32p^4^^ 

10 (ddATP) (Amersham, Arlington Height I L) In the presence of 0.2 mM 2- 
mercaptoethanol. Upon the addition of 40 units of the terminal 
deoxynucleptidyl trar>sferase enzyme (USB, Cleveland, OH), the reaction 
was allowed to proceed for one hour at ST'C. After this time, the 
reaction was stopped by immersion of the vial in TS^C water bath for ten 

15 minMtes, and the product was isolated using a Ghrbmaspin- 10 column 
(Clontech, Palo Alto, CA). Similarly, a 5'-disulfide-containing 
oligodeoxynucleotide was radiolabeled with ^^S. 

In cases where a 3'-disulfide-containing oligodeoxynucleotide was 
. to be immobilized, the 5'-terminus was radiolabeled using T4 

20 .polynucleotide kinase and a radiolabeled nucleoside triphosphate. For 
example, 1 5 pmol (0.6iuM) of the 3'^disulfide^6nfaihing 
,-oligpdeoxynucleptide was incubated with 50//Ci (16.5prn6t, 0.66/iM) of 
W"p) adpnosine-B'triphosphate (ATP) (Amersharn, Ariihgtpn Height, IL) 
. in the presence of 50 mM Tris-HCI, pH 7.6; lO mM MgClj, 10 mM 2- 

25 mercaptoethanoj. Following the addition of 40 units of t4 polynucleotide 
kinase, the reactiori was allowed to proceed for 1 hour at a^C. The 
reaction was stopped by immersion of the vial in a 75°C water bath for 
ten minutes; the product was then isolated using a Chromaspin-10 
column (Clontech, Palo Alto, CA). 
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To determine the density of covalentiy immobiiizeid probe, the 
disulfide-containing oligbdeoxynucleotide of choice was added to a trace 
amount of the same species than had been r^idioiabeled as described 
above. The disulfide was cleaved; the probe was immobilized on 
5 fodoacetamidorfunctionallzed wafers, the wafers were washed, and then 
exposed to a phosphorimager screen (Molecular Dynarriibs, Sunnyvale, 
CA). For each different oligodeoxynucleotide used/ reference spots were 
made on polystyrene in which the molar amount of oligodeoxynucleotide 
was known; these reference spots were exposed to the phosphorimager 
10 screen as well. Upon scanning the screeh/the quantity (in moles) of 
oligodeoxynucleotide bound to each chip was determined by comparing 
the counts to the specific activities of the references. 
Hybridization and Efficiency 

To a wafer that had been fuhctionalized with an immobilized probe 
15 was added a solution of a complementary sequence (10 jiM) in 1 M NaCI 
and TE buffer. The w^afer and solution were heated to 75°G and allowed 
to cool to room ternperature over 3 hours. After this' time/ the solution 
was removedv and the wafer was washed two times with TE buffer. 
To determine, the amount of oligonucleotide hybridized, 
20 immobilization of the probe was first carried out as described above 
except that the probe was labeled with ?®S rather than ^^p. The density 
of immobilizedLprpbe was determined with t Next, 
th^ same -wafer was incubated in TE buffer/ 1M NaCI, ahd its 
complementary strand (IGa/M): which had been radiolabeled with ^^P: 
25 Hybridization was carried put as previously described. Following a wash 
to remove non-specific binding, the wafer and reference were exposed to 
a phosphorimager screen with a piece of copper foil between the screen 
and the wafer. The copper foil serves to block the signal from ^^S, while 
allowing the ^^P signal to pass freely. The molar amount of hybridized 
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oligonucleotide is then determined, thus revealing the percent of 
covalently imrpobilized probe that is available for hybridization. 
MALDI-TOF Mass Spectrometric Analysis 
As described above, wafers containing non-radiolabeled 
: 6 immobilized ollgodeoxynucleotide (name: TCUG; sequence; 

GAATTCGAGCTCGGTACCCGGrmplecular we^ht; 6455 Da; SEQ ID 
. NO. 1) were synthesized, and a complemerrtary sequence (name: 
MJM6; sequence: CCGGGTACCGAGeTCGAATTC; molecular weight: 
6415 Da; SEQ ID NO. 2) Was hybridized; the wafers were washed in 
10 50mM ammonium citrate buffer for cation exchange tb remove sodium 
and pptassiurn ion? oh the DNA backbone (Pielbs. U. etal., (1993) Nud. 
Acidsj^ 21:3191-3196). A matrix solution of 3-hydroxypicolinic acid 
{3-MPA, 0.7 M in 50% acetonitrile, 10% ammbhium citrate; Wu. K.J., et 
aj, (1993) fepid Commun. Mass Spftrtmm 7.1 was spotted 

15 vonto the wafer^and ajlbwed. to dry at ambient temperature. The wafers 
were attached directly to the sample probe of a Finnlgan MAT (Bremen, 
. Germany) Vision 2000 reflectron TOF mass spectrometer using a 
conducting tape. The reflectron possesses a 5 IceV ion source and 20 
.KeVpost-acceleratipn; a nitrogen laser was erhployed; and M spectra 
20. . were taken in the positive ion mode. 
Surface Chemistry 

Employing standard silicon dioxide modification chemistry, a silicon 
wafer was reacted with 3-aminopropyltriethoxysllanb tb produce a 
uniform layer of primary amino groups on the surface. As shown in 
25 Figure 7, the surface was then exposed to a heterobifunctibnai: 
crosslinker resulting in iodoacetamido groups on the surface. It was 
possible to determine the optimal reaction tirrie of this reaction in solution 

using TLC. The SIAB crosslinker was visualiz d under long wave 
ultraviolet light (302 nm) to reveal a spot with an R, value of 0.58. 3- 
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. ; amindpropyltriethoxysilane was not active under ultraviolet light 

* . therefore^ nihhydrtn was used to reveal a purple spot indicating the 

presence of la primary amine at the baseline* A microscaie reaction was 
run using a slight molar excess of SIAB in comparison to 3- 
5 aminopropyltriethoxysHane; TLC analysis after approximately one minute 
revealed a new spot visible under long wave ultraviolet light with an 
yaiue of 0.28. There was ho evidence of a purple spot upon spraying 
with ninhydrin, thus all the 3-aminopropyltrieth6xysilane starting material 
had been consumed in the reaction. UV light also revealed the excess 

10. SIAB vyhich remained following the reaction. From these results, it was 
deterrninedfthe reaction is complete after approximately one minute. In 
, all cases, the iodoacetamido-functionalized wafers were used 
immediately to minimize hydrolysis of the labile iodoacetamido- 
functionality. Additionally, all further wafer manipulations were 

15 performed Jn the dark since the iodoacetamtdo-functtonality is light 
sensitive. . 

Disulfide reduction of the modified oligonucleotide was monitored 
by qbserving a shift in retention time on reverse-phase FPLC. It was 
determined that after five hours in the presence of DTT {ICQ mM) or 

20 TCEP (10 rnM), the disulfide was fully reduced to a free thiol. If the DTT 
reaction was allowed to proceed for a longer time, an oligonucleotide 
dimer formed in which pairs of free thiols had reacted. Such dimerization 
was ;also observed when the DTT was removed following the completion 
of the cleavage reaction. This dimerization was not observed when 

25 TCEP was employed as the. cleavage reagent since this reaction is 

performed at pH 4.5, thus the free thiols were fully protonated inhibiting 
dimerization. 

Irpmediately following disulfide cleavage, the modified 
oligonucleotide was incubated wrth the iodacetamido-functionalized 
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wafers. To ensure complete thiol deprotonatlbn, the coupling reaction 
was performed at pH 8,0. The probe surface deriislty ia&hieyed by this 
chemistry of silicon wafers was analyzed using radiolabeled probes and a 
phosphdrjmager. The probe surface density was also monitored as a 
J S function of the 4 TCEP concentration used in the disulfide ciea^vag^ 

reaction (Figure 8J. ::Uslng 1 0 mM TCEP tb cleave the disulfide and the 
other reaction conditions described above, it was possible to reproducibly 
yield a surface density of 250 fmbi per square mrfi of surface. Identical 
ej^periments as described above were performed except that the 
10 ; oligonucleotide probe lacked a thiol modification; surface densities of less 
than,5 fmol per square mm of surface proved that non-specific binding is 
mininrwl and that probe coupling most likely occurred as proposed in 
.Figure 7.,. ■ . 

.Hybridization 

15 After attaching '^S-labeled probes to the surface of wafers and 

determining conjugation density as described above; hybridization of ^^P- 
. labeled oligonucleotides was carried out; hybridization efficiency and 
density were determined using the phosphorimager and copper foil. It 
was determined experimentally that copper foii blocks 98.4% of an ^'^S 
20 signal, while fully allowing a ^'P signal to be detected. The 

complementary sequence reproducibly hybridized to yield 105 fmcl per 
square mm of surface; this corresponds to approximately 40% of the 
conjugated probes available for hybridization. Similarly, ^ non- 
complementary sequence was employed in this scheme yielding less than 
25^ 5 fmpi per square mm of surface in non-specific binding. 

It is hypothesized that stearic interference between the tightly 
packed oligonucleotide on the flat surface inhibits hybridization 
efficiencies higher that 40%. With this in mind, a spacer molecule was 
incorporated between the tenninus of the hybridizing region of the 
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* oiigonucleotide and the support. The chosen spacers were a series of 
poly, dT sequences ranging in length from 3 to 25. Upon examination of 
these samples with radiolabets and the phosphorimager, it was 
deterrnined that 40% was still the maximum hybridization that could be 
' ns achieved; 

MALpi-TOF MS Ana^^^^^ 

Wafers were f unctionalized With probes;^ complementary 
sequences were.hybridized, and the samples were analyzed under 
standard MALDI conditions as described above. Analysis revealed that 
10 only th^ arinealetJ ;strand (MJM6) was observed in the mass spectrum 
with an experimental mass-tp-charge ratio of 6415.4; the theoretical 
massrto-charge ratio is 6415 (Figure 9). Since there was no signal at a 
mass-to-charge ratio of 6455, it was determined that the wafer- 
, conjugated strand (TCUC) was not desorbed thus the iodoacetamido 

15 linkage was stable enough to withstand the laser and remain Intact. 
There was an additional signal. observed at a mass-to-charge ration of 
6262.0. This signal results from a depurination of guariosines since it is 
known that pNA is susceptible to the loss of purine bases during the 
MALDI process,, (Nordoff, E.> et aL (1992) Rapid Commun. Mass 

20 Spectrom. 6:771-776) . The sample crystals on the wafer were not 
homogenepusly distributed, . thus It was necessary to hunt for a good 
spot . Because of this non-homogeneity, the mass resolution varied, but 
it generally ranged from 200-300 for the desorbed oligonucleotide in the 
m?$s spectra. In one set of experiments, non-complementary sequences 

25 were hybridized to the wafer; following a wash as previously described, 
analysis by MAtDlrTOF MS revealed that minimal non-specific annealing 
had taken place since no signal was detected. 
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EXAMPLE 2 

Preparation of DNA aitays using serial and paralW dfepensIng to^ 

Robot-driven serial ahd pai-allel pL-ilL dispensing tools were used 
to generate 1 0-1 O^ elemerit DNA arrays on < 1 " square chips with flat or 
' ~5 geometrically altered (g^ with wells) surfaces for rtatrix assisted laser 
desorption ionizatibh mass spdctrdnletry analysis; ^ 1^ 
/piezoelectric pipette' (70 /mi id Capillary) di^ipenses single or multiple 
0.2 nL droplets of matrik, and then ahalyte, onto the chip; spectra 
from as low as 0.2 fmol of a 36-mei- DNA have been acquired using this 
iO procedure. Despite the faSt «5 sec) evaporation, micro^ryrtals of 3- 
hydroxypicolinic acid matrix conteining the analyte are routinely produced 
resulting in higher reproducibiiity than rpiitihely obtained with larger 
volume preparations; all of 100 fivefmol spots of a 23-mer in 800 A/m 
wells yielded^ easily interpreted niaijs spectra, with 99/1 00 parent ion 
15 signals having signal to nbise ratio of > 5. In a second approach, probes 
from 384 well microtiter plate are dispensed 16 at a time into chip wells 
or onto flat surfaces using an array of spring loaded pins which transfer 
"20 nL to the chip by surface contact; MS analysis of array elements 
deposited with the parallel method are comparable in terms of sensitivity 
20. and resolution to those made with the serial method. 

Description of the piezoelectric serial disp«nser 
The experimental systern developed from a system purchased from 
Microdrop GmbH. Norderstedt (3ermany and can include a piez 
element driver which sends a pulsed Signal to a pibzoetectric element 
25 bonded to and surrounding a glass capillary which holds the sblution to 
be dispensed; a^pressdre transducer to load (by negative pressure) or 
empty (by positive pressure) the capillary; a robotic xyz stage and robot 
driver to maneuver the capillary for loading, unloading, dispensing, and 
cleaning, a stroboscope and driver pulsed at the frequency of the piezo 
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6lement to enable viewing of 'sii'spisnded' drdplat cheirabteristics; 
separate stages for source dhd designation pieties or sltrfiple targets (i.e. 
Si chip); a Camera mounted to the robotic arm to \/iew loading to 
designation plate; and a data station which controls the pressure unit, 
5 xyz robot, ahd piezoelectric driver. 
. . Description of the paraller dispenser 

The robotic pintool includes 16 prbbes hdusjed in a probe block 
and mounted on an X Y, Z robotic stage. Tfie robotic stage was a gantry 
system which enables the placement of sample trays belbw the arms of 
10 the robot. The gantry unit itself is composed of X and Y arms which 
move 250 and 400 mm; respectively, guided by brushless linear servo 
motoi:s: with positional feedback provided by linear optical A 
lead screw driven Z axis (50 mm vertical travel) is mourited to the xy axis 
slide of the gantry unit and Is cdhtrolled by an In-llnb rotary servo motor 
15 with positional feedback by a motor-mounted rotary optical encoder. The 
work area of the systdni is equipped with k slide^ut tooling plate that 
holds five microtlter plates (most often, 2 plates of wash solution and 3 
plates- of sample for a maximum of 1 1 52 different oliigoniiCleotlde 
solutions) arid up to ten 20x20 mm wafers. The wafers are placed 
20. precisely . in the plate against two banking i)lns and held secure by 

■ vacuumi> The- entire system is enclosed in plexi-glass hous^^^ for safety 
and mounted onto a steel siipplbrt frame for thermal and vibrational 
damping. Motion control is accomplished by employing a commercial 
motion controlier which was a a^axis servo controller and is integrated to 
25 a computer; programming code for specific applications is written as 
needed. 

Samples were dispensed with the serial system onto sev ral 
surfac s which served as targets in the MALDI TOF analysis including (1) 
A flat stainless steel sample target as supplied for routine use in a 
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, Thermo Bioanalysis Vision 2000; (2) the same design stainless steel 
target with micromachined nanopits; (3) flat silicon {Six w 
polished flat Si wafers; (5) Si wafers with rough (3-6 pLrri features) pits; 
(6) (a) 12x12 or m 18x18) rrim Si chips with (a) 10x10 (or (b) 16x16) 
~ -5 arrays of chemically etched wells, eaqh.qOOxSOOl Im on a side with 

depths ranging frpm 99-400 (or(b) 120) micrometer, pitch (a) 1.0 (or(b) 
1 J 25) mm; (7) 15x15 mm Si chips with 28x28 arrays of chemically 
etche^ wells, each 450x450 micrometer on a side with depths ranging 
from 48-300 micrpmeter, pitch 0.5 mm;, (8)flat polycarbonate or other 
10 plastics; (9) flold an^l plher metals; OP) me^^^^^ 

sputtered with gold or other conducting materials. • The dispensed 
volume is controlled frorri 1 p '" to 10:« L by adjusting the number of 
droplets dispensed. 

Sanvple Preparation and Dispensing : 
. . : 'Serial. 

Oligonucleotides (0.1-50 ng/microliter of different sequence or 
concentrations were loaded into wells of a 96 well microtiter plate; the 
first well was reserved for matrix solution. A pitted chip {target 6a in 
MALDI targets; section) was placed pn the stage and aligned manually. 
20 Into the (Windows-based) robot control software were entered the 

coordinates of the first well, the array size {ie nurhber of spots in x and 

y) and spacing between elements, and the number of 0.2 hL drops per 
array element. The capiljary was filled with ~ 1 0 jt/L rinse HjO, 
autpmatically moved In view of a strobe light.illuminated camera for 
25 checking tip integrity and cleanliness while in continuous pulse mode, 
and emptied. The capillary was than filled with matrix solution, again 
checked at the stroboscope, and then used to spot an array onto flat or 
pitted surfaces. For reproducibil^ studies In different MS modes, 
typically a 10x10 array of 0.2-20 nL droplets were dispensed. The 
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capillary was emptied by application of positive pressure, optionally 
rinsed with HjO, and* led to the source oligo plate where of 0*05- 

> . 2.0//M synthetic olijgo were drawn. The capillary was then rastered in 

; series over each of the matrix spots with 0.2-20 nL aqueous solution 
5 added to each. 

. 2. Parallel - ' 

3 Parallel Programs were written to control array making by offset 
printing; to make an array of 64 elements on 1 0 wafers, for example, the 
tool was dipped into 16 wells of a 3 84 well DNA soured plate, moved to 

10 the target (e^fli Si, plastic, rhetal), and the Mmple spotted by surface 
^contact, the tool was then dipped into the same 16 wells and spotted 
•on the second target; this cycle wais riBpeated dn all ien Next 
the tool was dipped in washing solution, then dipped ihtd 16 different 
wells of the source plate, and spotted onto fKe targfet 2.25 mm offset 

15 from the initial; set of 16 Spots; again thiis wai on all 10 wafers; 

the entire cycle was repeated to make a 2x2 ah^ay from each pin to 
produce an 8x8 array of spots (2x2 elements/pin X 16 pins = 64 total 
elements spotted). ' 

To make arrays for MS analysis/ biigbriucl^otides dif different 

20 sequences or concenitrations werie loaded iritoi the wells 6lf up ib three 
different 384-well microtiter plated', one set^ Of 16 Wiells wa^ reserved for 
,,. matrix soiutlpri. The wells of two plates were filled with ^v&shing 
solution. The five microtiter plates were loaded onto the slide-out tooling 
plate. Ten wafers were placed abutting the banking pins on the tooling 

25 plate, and the vacuum turned bri. In cases where matrix and 

oligonucleotide were not pre-mixed, the pihtooi was used to spot matrix 
solution first on all d sired array elements of the ten wafers. For this 
example, a 16 x 16 array was created, thus the tool must spot each of 
the ten wafers 1 6 times, with an offset of i, 1 25 mm. Next, the 
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oligpnucleptide solution was; spotted in the same pattern to re-dissolve 
the matrix. Similarly, an array could be made by placing the 
oligonucleotide solution on the wafer first, followed by the matrix 
solution, or by pre-mbcing the matrix and oligonucleotide solutions. 
5 Mass spectrometry 

Subsequent to either dispensing scheme, loaded chips were held 
, onto a MALDI.TOF source plate with a Set of beveled screw mounted 
, . polycarbonated supports. The, plate was transferred on the end of a 
iprobis to be held onto a 1 /mi: resolution, 1 " travel xy stage (Newport) in 
10 the source region of a time-of-fjjght mass spectrometer. The instrument, 
, . normally operated with 18-26 kV extraction, could be operated In linear 
or curved field reflectrori mode, and in continuous or delayed extraction 
-mode. ;■ 

Serial dispensing with the pte^ 

15 While delivery of a saturated 3HPA solution can result in tip 

clogging as the solvent at the capillary-air interface evaporates, pre- 
mixihg DNA and matrix sufficiently dilutes the matrix such that it remains 
in solution while stable sprays which could be maintained until the 
capillary was eniptied were obtained; with 1 M diluted (in H^O) matrix 

20 solution, continuous spraying for > > 10 minutes was possible. Turning 
off th^ piezo element so that the capillary sat inactive for > 5 minutes, 
. . , Snd reactivating the piezo element also did not result in a clogged 

. capillary. 

Initial experiments using stainless stee} sample targets as provided 
25 by Finnlgan Vision 2000 MALDI-TOF system run in reflectron mode used 
a pre-mixed soliition of the matrix end DNA prior to dispensing onto the 
sample target. In a single microtiter well, 50yi/L saturated matrix solution, 
25^L of a 51A/L solution of the 12-mer (ATCG)3 {SEQ ID No. 3). and 
ZBfjL of a 51//L solution of the 28-mer (ATCG)7 (SEQ ID No. 4) were 
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. rnixed. A set. of. 10x10 arrays of 0.6juL drops was dispensed directly 
onto a Finnigan Vision 2000 sample target disk; IVIALDI^TOF mass 
spectrum was obtained from a single array element wliich contained 750 
attomples of each of the two oligonucleotides; Interpretable mass spectra 
~'$ has. been obtained forDNA molecules as large as a 53-mer (350 amol 
- . loaded, notjshown) using: this method. 

Mass spectra were also obtained from DNA molecules 
microdispensed into the wells of a silicon chip. Figure 10 shows a 
12x12 mm silicon chip with 100 chemically etched wells; mask 
10 dimensions and etch time were set such that fustum (i.e., inverted flat 
top pyramidal) geometry wells with 800x800/mi (top surface) and lOOjtmi 
depth were obtained, Optionally, the. wells can be roughed or pitted. As 
, described above, the chip edge was aligned against a raised surface on 
the stage to define the x and y coordinate systems with respect to the 
15 capillary. (Alternatives include optical alignment, artificial intelligence 
pattern recognition routines, and dowel-pin based rhanual isilignment). 
Into each wesll was dispensed 20 droplets (~5 nU of 3-HPA matrix 
solution without analyte; for the 50% CHjCN solution employed, 
evaporation tirnes for each droplet were on the Order of 5-1 0 seconds. 
Zp Upon solvent evaporation, each microdispensed matrix droplet sis viewed 
under a 1 20X sterepmicroscope generally appeared as ah amorphous and 
'milky' flat disk; such appearances are consistent with. those of droplets 
. from which the Figure 3b spectrum was obtained. Upon tip emptying, 
rinsirtg, and refillirig with a 1.4//m aqueous solution of a 23-mer DNA 
25 (M,(calc) = 6967 Da), the capillary was directed above each of the 1 00 
spots of matrix where 5 nL of the aqueous DNA solution was dispensed 
directly on top of the matrix droplets. Employing visualization via a CCD 
camera, it appeared that the aqueous analyte solution mixed With and re- 
dissolved the matrix (complete evaporation took - 10 sec at ambient 
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temperature and humidity). The amorphous matrix surfaces were 
converted to true micro^crystalline surfaces, with crystalline features on 
the order of < 1 //m. 

Consistent -with the Improved crystattizatiori afforded by the matrix 
5 re-dissolving method, mass spectrum acquisition apfiieared more 

reproducible than with pre-mixed matrbc plus analyte solutions; each of 
the 100 fivefmol spots of the 23-mer yiefded interpreted mass spectra 
(Figure 11), with 99/100 parent ion signals having signal to noise ratios 
r. of > 5; such reproducibility was also obtained with the flat silicon and 
10 metallic surfaces tried (not shown). The Figure 11 spectra were obtained 
on a linear TOF instrument operated at 26 kV. Upon internal calibration 
Of the top left spectrum (well 'k1 ') using the singly and doubly charged 
molecular ions, and application of this calibration file to all other 99 
spectra as an external calibration (Figure 12): a standard deviation of <9 
15 Da from the average molecular weight was obtained, corresponding to a 
relative standard deviation of ~ 0.1%. 

Parallel dispensing with the robotic pintool 

Arrays were made with offset printing as described above. The 
velocity of the X and Y stages are 35 inches/sec, and the velocity of the 
20 Z stage is 5.5 inches/sec. It is possible to move the X arid Y stages at 
maximum velocity to decrease the cycle times, however the speed of the 

Z stage is to be decreased prior to surface contact with the wafer to 
avoid damaging it. At such axes speeds, the approximate cycle time to 
spot 16 elements (one tool impreasiOn of the same solutions) on all ten 
25 wafers is 20 seconds, so to make an array of 556 elements would take 
-5.3 minutes. When placing different oligonucleotide solutions on the 
array, an additional washing step much be incorpordt&d to clean the pin 
tip prior to dipping in another solution, thus the cycle tim would 
increas to 25 seconds or 6.7 minut s to make 1 0 wafers. 
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Sample delivery by the tool was examined using radio-tabeied 
solutions and the phosphorrmager as described previously; it was 
determined that each pin delivers approximately 1 nL of liquid. The spot- 
to-spot reproduclbiiity is high. An array of 256 oligonucleotide elements 
. 5 of varying sequence and cbncehtration Was made oh flat silicon wafers 
using the pintool; and the wafer was ahaiyzied by MALDI-TOF MS. 

EXAMPLE 3 

Use of High Density Nucleic Acid Immobilization to Generate Nucleic Acid 
Arrays 

Employing the high density attachment procedure described in 
EXAMPLE 1, an array of DNA oligomers ahfienable to MALDI-TOF mass 
spectrometry analysis was created on a silicon wafer having a plurality of 
locations, e^, depresisipns or patches, on its surface. To generate the 
arrpy, a free thiol-containing oligpnucleotide prinrier was immobilized only 

15 at the selected locations of the wafer ( e,q. > see Figure 13). Each 
Ideation of the array contained one of three different oligomers. To 
Remonstrate that the different immobilized oligomers could be separately 
detected and distinguished, three distinct oligonucleotides of differing 
lengths that are complementary to one of the three oligomers vyere. 

20 hybridized to the ^rray on the wafer and analyzed by MALDI-TOF mass 
sp;ectrometry. 

Oligodeoxynucieotides . 

Three sets of complementary oligodeoxynucleotide pairs were 
synthesized in which one. member of the complementary oligonucleotide 
25 pair contains a 3'- or SSdistilficie linkage (purchased from Operbn 
Technologies or Oligos, Etc.). For example. Oligomer 1 
(d(CTGATGCGTCGGATCATCTTTTTT-SS); SEQ ID ND: 5) contains a 3'- 
disulfide linkage wher as Oligomer 2 (d(SS- 

CCTCTTGGGAACTGTGTAGTATT); a 5'-disuifide derivative of SEQ ID 
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10 



NO: 6) and Oligomer 3 (dMSS-GAATTCGAGCTCGGTACCCGG); a 5'- 
disulfide derivative of SEQ ID NO: 1) each contain a 5 '.disulfide linkage. 

The oligonucleotides complementary to Oligomers 1-3 were 
designed to be of different lengths that are easily resolvable from orie 
another during MALDI-TOF MS analysis. For example, a 23^mer 
oligonucleotide {SEQ ID NO: 7) was synthesized complementary to a 
portion of Oligomer 1 . a 1 2-mer oligonucleotide (SEQ ID NO: B) was 
synthesized complementary to a bortion of Oligomer 2 and a 21 -mer 
(SEQ ID NO: 2; sequence denoted "IWjJM6" in EXAMPLE 1) was 
synthesized complemeritary to a portion of Oligomer 3. In addition, a 
■ fourth 29-mer oligonucleotide (SEQ ID^ 
complementarity to any of thb three pligoni^rs. This fourth 
oligonucleotide was u^ed as a negative control. 

Silioon surfa<^ cheifhistry w^^^ immobilization 
15 v(a) 4 X 4 (ie-locatibn) array 

A 2 X 2 crn^ silicon wafer having 256 individual depressions or 
vvells in the forrn of a 16 X 16 well array was purchased^^^^f^^^ 
coftimercial supplier (Accelerator technology Corp., College Station. 
Texas ). The wells were SODX 800/;m2, 120 //m deefi, on a 1.125 
pitch. The silicon wafer was reacted with 3-aminopropyltriethoxysilane 
to produce a uniform layer of primary amines on the surface and then 

exposed to the heterobifunctional cfosslihlcer SIAB resulting in 
iodoacetamido functionalities on the surface (e^' see Figure 7). 

To prepare the oligomers for coupling to the various locations of 
the silicon array, the disulfide bond of eacii oligomer was fully reduced 
using 10 mM TCEP as depicted in EXAMPLE 1 , aiid the DNA 
resuspended at a final concentration of 10 ^/M in a solution of 100 mM 
phosphate buffer, pH 8.0. Immbdialely followirig disulfide bond 
reduction, the free-thiol group of the oligomer was coupled to the 



20 



25 



SUBSTITUTE SHEET (Rui^ 26) 



wo 99/31278 



PCT/US98/26718^ 



-75- 

iodoacetamido functionality at 1 6 locations on the wafer using the probe 
coupling conditions essentially as described in Figure 7. To accomplish 
the separate coupling at 16 distinct locations of the wafer, the entire 
: surfaccf of the wafer was not flushed with an oligonucleotide solution 
5 but, instead, an -30-nl aliquot of a predetermined modified oligomer 
was added in parallel to each of 16 locations (i.e., depressions) of the 
256 wells on the wafer to create a 4 x 4 array of immobilized DNA using 
a pin tool as described herein (see e.a.v the Detailed Description and 
Example 4 provided herein). 
10 Thus/ as shown in Figure 13, one of modified Oligomers 1-3 was 

/ covalently immobilized to each of 1 6 separate wells of the 256 wells on 
the silicon wafer thereby creating a 4 x 4 array of immobilized DNA. For 
example. Oligomer 1 was conjugated at a well position in the upper left 
hand corner of the 4 x 4 array and Oligomer 2 was conjugated to the 
15 adjacent location, and so forth. An illustration of the completed array is 
shown in Figure 13. 

In carrying out the hybridization reaction, the three complementary 
pligonucleotides and the negative control oligonucleotide were mixed at a 
final conceritration of 10 //M for each oligonucieotide in 1 ml of TE buffer 
20 {10 mM Tris-HCI, pH 8.0, 1 mM EDTA) supplemented with 1 M NaCI, 
and the solution was heated at 65°C for 10 min. Immediately thereafter, 
the entire surface of the silicon wafer was flushed with 800 /jI of the 
heated oligonucleotide solution. The complementary oligonucleotides 
were annealed to the immobilized oligomers, by incubating the silicon 
25 array at ambient temperature for 1 hr, followed by incubation at 4X for 
at least 1.0 min. Alternatively, the oligonucleotide solution can be added 
to the wafer which is then h ated and allowed to cool for hybridization. 
An Illustration of the complementary oligonucleotides annealed to the 
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specific oligomers covalently inrirtibbiiized at each Ibcatibn is shown in 
Figure 14. 

The hybridized array \ivai5 then washed with a solution of 50 mM 
^ %animonium citrate buffer for cation exchange to remove sodium and 
' 5 potassium Ions on the DNA backbone (Pleles, U. « al., {1993} Nucl. 
AeMsJss. 21:31 91-3196)/ A e^nl aliqu^^^ 

hydroxypicolinic acid (0.7 M 3-hydroxypicblfnlc abld-lO % ammonium 
citrate in 50 % acetbnitrilb; see Wu et a],, Ffaoid Commun Mass 
SfiectroriL 7:142-146 (1993)) was added to each location of the array 

10 using a piezoelectric pipette ais described herein. 

. The solution was allowed to dry at ambient temperature and 
thereafter a 6-nl aliquot of water was added tb each location using a 
piezoelectric pipette to resuspend the dried malrix-DNA complex, such 
that upon drying at ambient temperature the matrix-DNA complex forms 

15 a uniform crystalline surface bn the bottom surface of each location. 
MALDI-TOF MS analysis 

The MALDI-TOF MS «ysis Was performed in series on each of 
the 1 6 locations of the hybridization array Illustrated in Figure 14 
essentially as described in EXAMPLE 1 ; The resultihg mass spectnim-of 

20 oligonucleotides that specifically hybridized to each of the 16 locations of 
the DNA hybridization array Is shown In Figure 15. The mass spectrum 
revealed a specific slgnai at each locatiori representative of observed 
experimental mass-to-chai-ige ratio cbrrespbhding to the specific 
cornplementary nucleotide sequenpe. 

25 For example; in the locatlbhs that have only Ollgbmer 1 conjugated 

thereto, the mass spectrum revealed a jjredomlnate sigrial with an 
observed experimental mass-to-^charge ratio of 707214 approximately 
equal to that of the 23-mer; the thbbretical mass-to-charge ratio of the 
23-mer is 7072.6 Da. Similarly, specific hybridization of the 12-mer 
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oligonucleotide to the array, observed experimental mass-to-charge ratio 
of 361 8.33 Da (theoretical 3622.4 Da), was detedted only at those 
locations conjugated with Oligomer 2 whereas specific hybridization of 
MJM6, (observed experimental massTto-charge ratio of 641 5.4) was 

. 5 detec;ted only at those locations oif the arr^y cbhjugated with Oligomer 3 
(theoretical 6407.2; Da). r . 

None of the locations of the array revealed a signal that 
corresponds to the negative control 29-mer oligonucleotide (theoretical 
mass-^to-charge ratio of 8974.8) indicating that specific target DNA 

10 hiplecules can be hybridized to oligomers cbValently immobilized to 
specific locations on the surface of the silicon array arid a plurality of 
hybridization assays may be individually imohitbred u^ln^ MS 
. ;ana|ysis. ■■ ■ • : 

: (b) 8 X 8 (64-location) array 

""5 A 2 X 2 cm* silicon wafisr having' 256 individual depressio 

pr wells that form a 1 6 X; 16 arraysof wells was purchased from a 
commercial supplier (Accelerator Technology Corp., College Station, 
Texas ). The ynrells were 800 X 800 pm'^ 1 20 //m deep, on a 1 .1 25 
:Pitch. The silicon wafer was reacted^ with 3Tamih6pr<>|iyltrlethoxy8llahe 

20 to produce a uniform, layer of primary amines bri the surface arid then 
exposed to the heterobifunctiona|icr<>ss|infc« 
ibdoacetamido functionalities on th^ surface (e.d.; see Rgure 7) . 

. Following the prbcedurjes described above for the preparation of 
the re-locatlon DNA array. Oligomers 1-3 Were irTimobillzbd to 64 

25 locations forniing an 8 X 8 array on the 256 well s^^^^^ 

hybridized to complementary oligonucleotides arid analyzed by MALDI- 
TOF MS analyisis. Figure 16 shows the mass spectrum bf the 64- 
location DNA array analyzed in series by MALDI-TOF analysis. As shown 
for the 16-location array, specific hybridization of the complementary. 
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oligonucieotide to each of the immobilized thiol-containing oligomers was 
; : , observed in each of the locations of thd DNA array. 

EXAMPLE 4 

.1 BNA transcription of on»-8trand nicked DMA tempiate 

75 IThis example^^^e^^ the effects of coding strand nick 

placement on the efficiency at which RNA transcripts are generated and 
also provides optimal conditions for generating RNA transcripts for 
sequencing. 

1 . Dedgn of Template and Primer Sequences 
"•P All primers were synthesized on a commercially available DNA 

V . .Syii^^^^ Mslna Conventional phosphoroamidite chemistry (Sinha et aL 
(19841 JSlyc^ In vitro RNA transcrlj)tion was 

performed on a synthetic 55 nucleotide double-stranded DNA template. 
The template was assembled using three primer sequences; a 55 
15 nucleotide noncoding strand (SEQ ID No: 10) and two additional primers 
that form the coding strand (SEQ ID Nos: 1 1 and 12). As shown in 
Figure 1 7, the specific position of the nick in tine coding strand at a 
defined by the length of the each of the coding stand primers. Use of 
this construct for transcription demonstrated that a nick at tills position 
20 does not substantially attenuate transcription. 

2. Primer Hybridization and RNA Trahscrlptibn 

One strand nicked templates were produced th^ hybridization of 
the three single-stranded oligonucleotides at a total DNA concentration of 
1 Oa/M J2 fojd excess of primers over template) in 10 rriM MgClj by 
25 heating the. reaction mixture at 70 <»G for 10 minutes and cooling to room 
temperature for at least 4 hours; The position of tiie nick is determined 
by tfie corresponding lengths of tine tvyo coding ^ahd (5'-3') DNA 
oligonucleotides. 
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SP6 FtNA Polymerase Transcription 

in vjtro transcription of the nicked DNA templates was performed 
in 20 lA reactions of 40 mIM Tris-HCI (pH 7.0), 6 rxM MgClz. 2 mlVI 
sperrriidme; 10 mM MaCI, 1 0 ml\A dithibthi'eitol, 1 unit///! RNasin 
5 (Promega). 5 mM rNTP, 5;/Ci Iff-32PjrCTP, 1 unitAt/l SP6 RNA polymerase 
(Amelrsliam, Arlington Heights, 11) at 37*'C for 30 minutes. Abortive and 
full length RNA transcripts were separated by gel electrophoresis arid 
' quaritified by nieasuring the radioactivity of individual RNA fragments by 
drying the polyacrylamide gel and measuring the radioactivity as 

10 compared to a known standard using a Phbspholmager (Molecular 
Dynamics, Inc.). the efficiency oif full-lengtH RNA transcription of a 
nibked DNA template was calculated as a percentage the moles of full 
length RNA transcribed from a DNA ternplate containing no nicks. The 
nick by pass efficiency of a nicked DISIA template was calculated as a 

15 percentage of the moles full-lerigth RNA transcript and the moles of RNA 
trariscript stalled at the nick. 

As illustrated in table 1 , the transcri|3ti6ri of a full-length RNA 
proceeds with 88-94% efficiency when a nick is introduced into the 
coding strand after nucleotides +7. +8, +9 or +19 relative to the start 
.20 of transcription. 

• ■ ■ " - ^ - TABLE 1 



25 





6-mer 
RNA (%) 


Nick-stalied 
RNA (%) 


Full-Length 
^ RNA 1%) 


Nick^Bypass 

Efficiency 

(%) 


Ref. 


320 ±93 




(100) 


100±0 


1N + 7U 


193.2±46.3 


12.45.4 


95.8±6.5 


88.5±5,1 




187.4 ±3919 


5.3±4.2 


87.1 ±4.3 


94.3±0.5 


1N + 9U 


232.7 ±45.6 


10.1±1.1 


65.7±4.4 


87.6 ±1.8 


1N+19U 


279.6 ±33.8 


6.6±0.3 


64.8 ±4.5 


90.9 ±0.4 



SUBSTITUTE SHEET (RULE 26) 



W099/3I278 



PCtA;S98/2671«- 



-80- 



EXAMPLE5 

DNA Sequencing uising T7 RNA Polymerase 

5 1. Design of Template and Priimr Sequences 

- All primers were synthesized on a cornmerciaily available DNA 

Synthesizer using cpnyentlonal phosphoroamldlte chemistry (SInha at ah 
(1 984) Nucleic Add figs. 12:4539). Jn yifta RNA= transcription was 
perfprnried on a synthetic 276 nucleptidie double-stranded DNA template 
10 (SEQ ID No. 13. vyhich includes a^T7. promoter). 
2. DNA Sequencing . , 

DNA sequencing of:a target DNA template was carried out in 10//I 
reactioris Qf 4p miVI Tris-HCI (pH 7.9). 6 mM MgGli, 2 mM neutralized 
spermidine, 5 mM dithiotiireitpl, 300 nM rCTP,^^ 300 nM 

nM rGMP, ipv/CI (a-«P) rCTP or UTP, 10-300, 
generally 10-50 //M 3'^eoxynuGleotide, 0,5-1 .0 pmole linearized or 
supercoiled DNA template, 4 units RNasin .(Promega), 10 unit/;;! T7 RNA 
polymerase (USB) at 37<»C for 30 minutes. Chain termination fragments 
RNA transcripts were separated by gel electrophoresis using 8% or 1 3% 
20 polyacrylamide gel. 

DNA sequencing ladders of RNA tenninated fragment were 
generated up to 180-200 bases. This example shows that T7 RNA 
polymerase can specificaHy iKcorporate modified 3'-deoxyrlbonucleosides 
triphosphates as a base-specific chain terminiator to generate nested RNA 
25 transcripts for sequencing nucleic acids. 

Since modifications will be apparent to those of skill in this art, it 
is intended that this invention be limited only by the scope of the 
appended claims. 
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WHAT IS CLAIMED; 

jl . ..A. method for determining the sequence of a target nucleic 
acid mojecule, comprising: 

a); providing a .nucleic acid molecule comprising a promoter and 
5 target nucleic acid sequence operatively linked to the promoter, wherein 

the nucleic acid nriolecule<is immobilized on a solid support; 
, f .. W *'^a"scribing the promoter-containing nudeic acid molecule wft^ 
an RNA polymerase that recognizes the promoter under conditions 
whereby a nested set of RNA transcripts from the target is produced; 
10 and 

,c) determining the molecular weight values of the transcripts by 
mass spectrometry and thereby determining the nucleic acid sequence of 
the target molecule. 

2. The method of claim 1, wherein the promoter-containing 
15 nucleic acid molecule is immobilized via the 5' end of the coding strand 

or the 3' end of the non-coding strand. • 

3. The method of claim 1, wherein the nucleic acid is DNA or PNA. 

4. The method of any of claims 1 -3/ wherisin the nucleic acid 
molecule conipirising a pronrioter and target IS prepared by: 

, hybridizing a nucleic acid promoter-containing probe having a 
coding and ppncpding strand to a nucleic acid molecuie comprising a 
target nucleic acid sequence, wherein hybridization occurs between a 
single stranded region of the promoter-containing probe and a 
complementary single stranded region at onb end of the molecule 
25 . comprising the target. 

5. The method of claim 4, wherein the molecule comprising the 
target is single-stranded and the portion comprising the complementary 
single-stranded region Is at the 3' end of the target-containing molecul ; 
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and the promoter containing probe mblecule Is dou^^ 
with a single-stranded region of at least 5 nucleotides at the 3' end of 
the coding strand, wherein at least 5 contiguous nubleotides are 
complementary to the single-strartded portion of the target-containing 

. S' .,,mp.lecule*.;; 

. .} Y , 6.; . The.method of claim 4, whei-bin: 

the mplecule comprising the target is dpubie-stranded except for a 
:r singlerstranded region at one arid that comprfees at least 5 contiguous 

nucleptide? complementary to the single-^stranded portibrt of the 
10 prornoter-containing molecule; and 

;v«t sampcend,: relative to the promoter, that the singi^-str^hded region is 
on the target-containing molecule. 

. - . 7. The method of any of claims 4-6, wherein the promoter- 
1 5 containing molecule is immobilized on the solid support prior to 
hybridization of the target-containing molecule. 

8. The rhethod of any of claims 4-6, wherein hybridization is 

, PerformMpriorto immobilization of the promoter-containing molecule on 
the solid support. 

20 , 9- The method of any of diaims 4-8, Wherein nicks In the 

, resulting mpleeules are Jigated prior to tranSd^^^^ 

..polymerase...::-/,' . ■ . . 

, 10. The method of any of claims 1-9, Whbrein the huc^^ 
molecule comprising a promoter and target nucleid geld Is transcribed 
25 with an RNA polymerase to produce a plurality of b^SP-sf)ecificiily 
terminated RNA' transcripts. 

1 1 . The method of any of claims 1-9, Wherein th^ 
molecule comprising a prompter and targ t nucibiP acid Is transcribed 
with an RNA polymerase to produce a set of RNA transcripts that are 
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then specifically cleaved with base-specific ribonucleases to produce a 
set' of cleaved fragments whose molecule weight values are assessed by 
> . mass spectrometry. • • 

1 2. The method of any of claims 1 -1 1 wherein the sequence of 
5 all or a portion of the target nucleic acid is determined by the RNA 
transcripts according to molecular weight. 

/ The method of any of claims 1-12/ wherein the nucleic acid 
molecule comprising the promoter is produced by infimobilizing a single- 
stranded molecule that comprises a promoter or the complement of a 
10 promoter and hybridizing a fragment that comprises the complement 
. ^f;»^r®o^' whereby the resulting molecule comprises a double-stranded 
region cpntaining a promoter and a single-stranded portion of at least 5 
nucleotides. 

1 4. The method of any of claims 1 -1 3, wherein the promoter is 
15 selected fronri the group consisting of phage/eukaivbtic and p 
jpronioters. » 

The fTiethod of claim 14> wherein the promoter Is selected 
from arphaebpcteria, eubacteria> bacteriophages, DNA viruses, RNA 
yiruse^, plants,, p^^ 

20 16. The method of any of claims 1 -15, wherein the RNA 

V polymerase is a DNA-dependeht RNA polymerase or an RNA-dependent 
RNA pplymerase. > - 

17. The method of any of clalrfis 1 -1 6, wherein the RNA 
polymerase is selected from the group consisting of phage, eukaryotic 

25 and pfpkary otic polymerases. 

18. The method of claim 17, wherein the polymerases is 
selected from archaebacteria, eubacteria; bacteriophages, DNA viruses, 
RNA viruses, plants, plant viruses and animal polymerases. 
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19. The method of any cff daims1-1?8i wherein the RNA 
^polymerase is selected from the group consiisting of Escherichia coii RI\IA 
polymerase, T7 RNA polymerase, SP6 RNA polymerase, T3 RNA 
polymerase and replicase. 

5 20. The method of any df claims fil 9/ Wherein prior to 

immobilization of the nucleic acid; the surface of the support is 

. derivatized by reacting the surf ace with ah amiho^ilane to produce 
primary amines on the surface of the sup^^^ 

21. The method of claim 20, Wherein the aminosilane is 3- 
10 amindprppyltriethoxysilahe; 

22. The method of claim 21, further comprising reacting the 
primfry amines on the surface of the supp cross- 
linking reagent to form a thiol-reactive solid support. 

23. The method of claim 22, wherein the thiol-reactive cross- 
15 Jinking reagent is N::Succtnimldyl {4-iodoacetyl) aminobenzoate (SIAB). 

24. The method of claim 22, wherein the imrhobiiization of the 
nucleic acid probe to a solid support is effected by reacting the thiol- 
reactive solid support with a nucleic acid probe having a free 5-' or 3'- 
thiol group, whereby a covalent bond between the thiol group and the 

20 thiol-reactive solid support is formed, 

25. The method of any of claims 1-21, wherein the nucleic acid 
molecule is immobilized to the surface the *olid support at a density of at 

; > , least 20 fmol/mm^ by a covalehl linkage. 

26. The method of any of claims r-26;%h€irein the nucleic acids 
25 are Immobilized on the surface of the solid, support •„ th^ form of an 

, n, ■array.; ■ 

27. The method of any of claims 1 -26, Wli^reiii tfie solid 
support is silicon or silicon Coated. 
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28. The method of any of Claims ,1-27, wherein the surface 
comprises a plurality of wells comprising the immobilized nucleic acid 
moiecule.,. 

29. The method of claim 28; wherein the^'welis have a rough 
" ~"5 Interior surface.. , ,: 

30. The method of claim 28, whereiin the s^^^ 

rough, surface.,,.,,. 

31. Tbp . method of claim 28, wherein the surfaM 
..pitted. . ..... 

10 32. The method of any of claims ^1-31 

wherein the, transcription reactions are performed in the presence of a 
ribonucleoside triphosphate analog, whereby the resulting RNA 
transcripts have reduced secondary structure or the fidelity of termination 
and turnover of the RNA polymerase enzyme is increased compared to 

1 5 RNA transcripts formed from ribonucleoside triphosphatase. 

33. The method of claim 32, wherein the transcription reactions 
include one or more analogs selected from among ihbsine S'-trlphosphate 
(ITP), 4-thib uridine 5 'triphosphate lUTP), 5-bromo Uf P and 5'-iodo CTP. 

34. The method of any of claims 1 -33, wherein the nucleic acid 
20 is immobilized to the solid support via a linker. 

3|. The method of claim 34, wherein the linkage formed by the 
. Iinker,is/ever!si.ble. . . : - 

36. The method of claim 34, wherein the liiikagd formed by the 
linker is irreversible. 

25 37. The method pf claim .34, wherein the linkage formed by the 

linker is photolabiie, acid labile or chemically cleavable. 

38. The method of any of claims 1-37, wher in transcription is 
performed in the pr sence of one or more 3'-deoxyribonucleotides. 
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as. The method Of any of Claims 1-38/ further com 
adding a matrix material for mass spectrometry to the surface of 
the support before or after immobilization of the nucleic acid, 
.4P* = The method of any of claims 1-39, Wherein the RNA 
' 5 transcripts are conditioned prior to performing mass spectrometry. 

41 . The method of any of claims 1-4, wherein the hybridization 
of the nucleic acid containing the target sequence to the promoter- 
containing molecule results in the formation of a niclc in the coding strand 
corresponding to positions beyond +6 relative to the start of 

10 transcription froro the promoter. 

42. , The method of claim 24, wherein the nick is at position 
; ; +7i + 8,:+ 9 or +19. 

43. The method of claim 1 , comprising: 

a) immobilizing a nucleic acid promoter-containing probe on a 
15 solid support, wherein the nucleic acid promoter-containing probe 

comprises at least 6 nucleotides at the 3'-end of the coding strand that is 
complementary to a single-stranded region at the 3'-end of the target 
nucleic acid; 

, b) hybridizing the nucleic acid to be sequenced to the 
20 immobilized nucleic acid probe; 

c) transcribing the target nucleic acid with an RNA polymerase 
to produce a plurality of base-specifically terminated RNA transcripts, 
wherein; the RNA polymerase 

d) determining the molecular weight value Of each 

25 base-specifically terminated RNA transcript by mass spectrometry. 

44, A method of identifying transcriptional terminator sequences 
or^attenuatpr sequences; comprising: 

a) - immobilizing a nucleic acid promoter-containing probe on a 
solid support, wherein th nucleic acid promoter-containing probe 
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comprises at least 5 nucleotides at the S'-end of the coding strand that Is 
complementary to a single stranded region at the 3'-end of the target 
nucleic acid; 

b) hybridizing the nucleic acid to be sequenced to the 
'5 immobilized nucleic acid probe; 

c) transcribing the targist nucleic acid with an RNA polymerase 
to produce a sequence-terminated RNA transcript, wherein the RNA 
polymerase recognizes the promoter; and 

d) determining the molecular weight value of the RNA 

0 transcript by mass spectrometry, whereby the terminator sequence or 
attenuator is identified. 
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1024 WELL (32x32) S CHIP 

FIG. 5 A 




1024 WELL (32x32) Si CHIP 



FIG. 5B 
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FIG. 5G 




V024 WELL (32x32) Si CHIP 

FIG. 5D 
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(1) GENERAL INFORMATION 

(i) APPLICANT: 

(A) NAME: SEQUENOM, INC. 

(B) STREET: 11555 Sorrento Valley Road 

(C) CITY: San Diego 
— *^ (D) STATE: CcQifornia 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 92121 

(i) APPLICANT: 

(A) NAME: KOREA ADVANCED INSTITUTE OF SCIENCE AND TECHNOLOGY 

(B) STREET: 373-1 Kusong-dong 

(C) CITY: Taejon 305-701 
{D) STATE: Yusong-gu 

(E) COUNTRY: Republic of Korea 

(F) PdSTAL CODE (ZIP): 92121 

(i) INVENTOR 

(A) NAME: Changwon Kang 

(B) STREET: Expo i^t . 305-401, 464-1 Jeonmin-dong 

(C) CITY: Taejon 305-390 . 

(D) STATE: Yusong-gu 

(D) COUNTRY: REPUBLIC OF KOREA 

(E) POSTAL CODE (ZIP) : . 

(i) INVENTOR 

(A) NAME: Young -Soo Kwon 

(B) STREET: 25-2 Mangwol-dong 

(C) CITY: Kwangju 500-500 

(D) STATE: Puk-gu 

(D) COUNTRY: Republic of Korea 

(E) POSTAL CODE (ZIP) : 



(i) INVENTOR: 

(A) NAME: Young Tae Kim 

(B) STREET: Sungwon Apt. 2-204, 162 Songpa-'2-dong 

(C) CITY: Seoul 138-172 
(15) STATE: Songpa-gu . 

(D) COUNTRY: REPUBLIC 

(E) POSTAL CODE (ZIP) : 



(i) INVENTOR: 

(A) NAME: Hubert Koster 

(B) STREET: 836 Via Mallorca Drive 

(C) CITY: La Jolla 

(D) STATE: California 

(D) COUNTRY: USA 

(E) POSTAL CODE (ZIP) : 92037 . 

(i) INVENTOR: 

(A) NAME: Daniel P.. Little • 

(B) STREET: 393 Glendale Lake.Rd. 

(C) CITY: Patton 

(D) STATE : Pennsylvania 
(D) COUNTRY: USA 

(£) POSTAL CODE (ZIP) : 18668 
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(i) INVENTOR: 

(A) NAME: Maryanne O'Donnell 
<B) STREET: 
<C) CITY: 
(D) STATE: 

(D) COUNTRY: 

(E) POSTAL CODE (ZIP) : 

' (i) INVENTOR: • 

(A) NAME: Goubing Xiang 

(B) STREET: 11381 Zapata Ave - 

(C) CITY: San Diego 

(D) STATE: California 

(D) COUNTRY: tJSA" 

(E) POSTAL CODE (ZIP): 92126 



(i) INVENTOR: 

(A) NAME : Charles Cantor 

(B) STREET: 

(C) CITY: 

(D) STATE: California 

(D) COUNTRY: : USA 

(E) POSTAL CODE (ZIP) : 

(i) INVENTOR: 

(A) NAME: David Lough 

(B) STREET: 

(C) CITY: 

(D) STATE : 

(D) COUNTRY: USA 

(E) POSTAL CODE (ZIP) : 



(ii) TITLE OF INVENTION: MASS SPECTROMETRIC METHODS FOR SEQUENCING 
NUCLEIC ACIDS 

(iii) NUMBER OF SEQUENCES: 13 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Heller Ehrman White & MeAuliffe 

(B) STREET: 4250 Executive Square, 7th Floor 

(C) CITY: La Jolla 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 92037 

(v) COMPXTTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible . ' 

(C) OPERATING, SYSTEM: DOS 

(D) SOFTWARE: ASCII 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 



(vii) PRIOR APPLICATION DATA: 
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(A) APPLICATION NUMBER: 08/990,851- 

(B) FILING DATE: 15 -DEC- 1997 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Seidman, Stephanie L 

(B) REGISTRATION NUMBER: 33,779 

(C) RBFBRENCE/DOCKET NUMBER: 24736-2007 



(ix) TELECOMMUNICATION INFORMATION: / 

(A) TELEPHONE: 619-450-8400 . 

(B) TELEFAX: 619-450-8499 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unJcnown 

(ii) MOLECULE TYPE: cDNA • 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO " 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: . 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
GAATTCGAGC TCGGTACCCG G ' 21 ; . 

(2) INFORMATION FOR SEQ ID N0.:2: 

(i) SEQUENCE CHARACTERISTICS: / 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO . . 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:.2: 
CCGGGTACOG AGCTCGAATT C . . 21 

(2) INFORMATION FOR SEQ ID N0:3:. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: \in3cnovm 

(D) TOPOLOGY: un)uiovni 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
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(v) FRAGMENT TYPE:. 

(vi) OiUQINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 
ATCGATCGAT CG 

_ (2) INFORMATION FOR SEQ ID Nb:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOI^OLOGY: imknowh 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
<vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: S£Q XD N0:4: 
ATCGATCGAT CGATCiSATCG ATCGATCG 

(2) INFORMATION FOR SEQ ID H0s5: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CTGATGCGTC GGATCATCTT TTTT 

(2) INFORMATION FOR SEQ ID N0:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 baise pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



CCTCTTGGGA ACT(3TGTAGT ATT 



23 
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(2) IMFORMATION FOR SBQ ID N0:7: 

(i) SEQUENCE CHARACTERISTICS: \ 

(A) LENGTH: 23 base pairs • -r 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : single 

(D) TOPOLOGY: luikziOwnT :; 

— (ii) .MOLBCUI-B TYPE: cDNA 

(ili) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: : • : 
(xi) SEQUENCE DESOIIPTION: SEQ ID NO:?: 

GATGATCCGA CGCATCAGAA TGT 22 
(2) INFORMATION FOR SEQ ID NOtS: 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICTUit NO 

(iv) ANTISENSE: NO : . 

(v) FRAGMENT TYPE:. 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ. ID NO: 8: 
AATACTACAC AG ... ,12 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 29 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: uxiknown 

(D) TOPOIjOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

. (iv) ANTISENSE: NO , • - 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQXiENCE DESCRIPTION: SEQ ,ip .N0:.9 : . 
GATCTAGCTG GGCCGAGCTA ^GGCCGTTGA 29 - 

(2) INFORMATION FOR SEQ ID NO: 10: . . . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs •; *; 

(B) TYPE: nucleic acid ' 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO . . 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
ATJ83ACGCTG CTGGACGGCA CCCTTCTCCA AGACTTGTAT AGTGTCACCT AAATC 55 
(2) INFORMATION FOR SEQ ID NO: 11: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
(5ATTTAGGTG ACACTATA6A AGTCT . ;25 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: xinknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TGGAGAAGGG TGCCGTCCAG CAGCGTCTAT 30 
(2) INFORMATION FOR SEQ ID N0:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 276 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
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GCTCTAATAC GACTCACTAT AGGGAGACAA 
ATCCCCGGGT ACCGAGCTCG AATTCCTGGC 
GCCGTTGCTT CGCAACGTTC AAATCCGCGT 
TCACCCGACA AACAACAGAT AAAAAAAAA6 
TtTGATGCCT GGGAATTCGT ATTCTATTCT 



GCTTGGATGC CTGCAGGTGG ACTCTAGAGG 60 
AGrrXATGGC GGGC6TCTGC CACCGTCCGG 120 
CCGGCGGATT TGTCCTACTC AGGAGAGCGT 180 
CCCAGTCTTT CGACTGGGCC TTTCGTTTTA 240 
ATAGTG . . . , 276 
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